[ 229 ] 



VII. Mathematical Contributions to the Theory of Evolution. — IV. On the Probable 
Errors of Fi^equency Constants and on the Influence of Random Selection on 
Variation and Correlation, 

By Karl Pearson, F.R.S,, and L. N. G. Filon, B.A.^ University College, Lo7idon. 

Received October 18, — Read November 25, 1897. 

Contents. 

Page 

I. Introductory 229 

11. On the probable errors of a. system of frequency constants. — Greneral Theorem . . 231 
On the determination of the probable errors and the error correlations of the 

frequency constants 236 

III. On the probable errors and the coefficients of correlation between errors made in the 

determination of the constants of a normal frequency distribution . . ... 237 

Probable error of a regression coefficient for two organs 244 

Probable errors of variation and correlation for three organs 245 

Case of four or more organs 250 

ly. On the probable errors and the coefficients of correlation between errors made in the 

determination of the constants in skew valuation . 265 

In the case of the curve 

y-y^{'+ptij''~'' • • 266 

Numerical illustration : Incidence of enteric fever 279 

In the case of the curve 

y — y^ (1 + xja,y^i (1 — xla^Y'-:^ ........ 282 

Numerical illustration : Glands of the forelegs of swine 289 

In the case of the curve 

Numerical illustration : Stature of children 303 

Conclusion 309 

Note on the probable error of the criterion . 310 

I. Introductory. — General Theorem. 

(1) In earlier memoirs by one of the present authors, methods have been discussed 
for the calculation of the constants (a) of variation, normal or skew/'' (&) of correla- 



* ** Mathematical Contributions to the Theory of Evolution. — II. On Skew Variation," 'Phil. Trans.,' 
A, vol. 186, pp. 343-414. 

12.7.98. 



230 PROFESSOU K. PEAESON AND MR. L. N, G. FILOIST 

tion, when normal.^ The subject of skew correlation would now naturally present 
itself, but although several important conclusions with regard to skew correlation 
have been worked out, there are still difficulties which impede the completion of the 
memoir on that topic. Meanwhile Mr, G. U« Yule has shown that the constants of 
normal correlation are significant, if not completely descriptive, even in the case of 
skew coiTelation.t It seems desirable to take, somewhat out of its natural order, the 
subject of the present memoir, partly because the formulae involved have been once 
or twice cited and several times used in memoirs by one of the present writers, and 
partly because the need of such formulse seems to have been disregarded by various 
authors in somewhat too readily drawing conclusions from statistical data. Differences 
in the constants of variation or of correlation have been not infrequently asserted to 
be significant or non-significant of class or of type, or of race differences, without a due 
investigation of whether those differences are, from the standpoint of mathematical 
statistics, greater or less than the probable errors of the differences. Notwithstanding 
that every artificial or even random selection of a group out of a community changes 
not only the amount of variation, but the amount of correlation of the organs of its 
members as compared with those of the primitive group, J it has been supposed that 
correlation might be a racial constant, and the approximate constancy of coefficients 
of correlation of the same organs in allied species has been used as a valid argument. In 
the like manner differences in variation have been used as an argument for the activity 
of natural selection without a discussion of the probable errors of those differences. 

In dealing with variation and correlation we find the distribution described by 
certain curves or surfaces fully determined when certain constants are known. These 
are the so-called constants of variation and correlation, the number of which may 
run up from two to a very considerable figure in the case of a complex of organs. If 
we deal with a complex of organs in two groups containing, say, n and n' individuals, 
we can only ascertain whether there is a significant or insignificant difference between 
those groups by measuring the extent to which the differences of corresponding 
constants exceed the probable errors of those differences. The probable error of a 
diff'erence can at once be found by taking the square root of the sum of the squares 
of the probable errors of the quantities forming the difference. Hence the first step 
towards determining the significance of a group difference- — ^.e., towards ascertaining 
whether it is really a class, race, or type difference — is to calculate the probable errors 
of the constants of variation and correlation of the individual groups. This will be 
the object of our first general theorem. 

* " Matliematical Contributions to the Theory of Evolution.- — III. Regression, Heredity, and Pan- 
mixia," ' Phil. Trans.,' A, vol. 187, pp. 253-318. 

t "On the Significance of Bravais' Formulse for Skew Correlation," * Boy. Soc. Proc.,' vol. 60, 
pp. 477-489. 

{ This will be sufficiently indicated in the latter part of the present memoir, but has been more fully 
dealt with in a paper on the " Influence of Selection on Correlation," written, but not yet published. 



MATHEMATICAL CONTRIBUTIONS TO THl THIORY OF EVOLUTION. 231 

II. On the Probable Erkors of a Sybtem ot Fbequency Constants.^ — 

(2) Let there be a group of ti individuals, for each of whom a complex o£ m organs 
is measured, and let z 8xi 8x2 , • . SXf,^ be the frequency with which individuals having 
a complex of organs lying between Xi, Xi + Sa?i ; X29 X2 + ^^2 1 • • • ^m? ^m + ^^m, occur 
in the total group of n. Here x shall measure the deviation of any organ from the 
mean of all like organs in the group. Let hi, h^, ?hj . . . /i^, be the mean measurements 
on the organs, so that hi + i^i, h^ -h 0O2, . . . hm + ^m^ is the system giving the actual 
measurements on any individual Then hi, ^2? ^3, . . . Ki, are the first set of constants 
of the frequency ; they determine the *' origin " of the frequency surface. 

Let this frequency surface be given by 

^ -^ J \^l9 *^2f «^3> • • • '^m) ^lj^25 ^33 • • • ^p/i 

where c^, C2, C3, . . , c^,, ^'^^^ P frequency constants^ which define the form as distin- 
guished from the position of the frequency surface, and which will be functions of 
standard deviations, moments, skewnesses, coeflicients of correlation, &c., &c., of indi- 
vidual organs, and of pairs of organs in the complex. 

The problem before us is to find the probable errors of the /I's and the c's, which 
constants fully determine the position and shape of the frequency surface. Let 
c^/i? o";,, cr^^, . . . cT;,^, GTc^, (Tc^, • . . cT^^^, bo thc staudard deviations of the quantities 
/ii5 ^25 • • • hm and Ci, C2f . . . c^. Then a knowledge of these standard deviations will 
give us at once the probable errors of the frequency constants, for we have only to 
multiply the former by the numerical factor '6745 to obtain the latter. 

Let us now suppose that the value of the frequency constants had been hi + A^i, 
h2 + ^^2? * * * hn + M,,„ Ci +• Acj, C2 + AC2, . . . e^+ Ac^,, instead of the observed 
values. 

Then the frequency of any observed individual would have varied as 

f{xi + Ahi, X2 + A^25 . . . ^m + ^^m ; ^1 + ^<^u ^2 + A<^2, . . . c^ + Ac^) 
instead of as 

Hence on this hypothesis the probability, P^, of the set of individuals observed in 
the group actually occurring is to the probability, Po, of the set occurring when the 
constants are hi, ^2? • . • ^m J <^i5 ^2, • • • Op, in the ratio of the product of all quantities 
like / (xi + A/ii, X2 + ^^2f . • • ^m + ^h, ; Ci + AOj, O2 + AC2, . . . c^ + ^<^p) fo^ a.11 
values of Xi, a?2> • • • ^m» to the like product of all quantities like f{xi, X2, . . . x^; 
^i» ^2j • • • ^p/p or 

p^ n/(% + Ahi, % 4- A^2> • * ' ^m + ^Kn ; ^1 + A(?i, Cg -f A{?2, . . . e^ 4- Ac^) 



1 11/ {Xif :^2> • • • '^■m ? ^1» % • • • %) 



• 



232 PBOPESSOE K. PBARSOH AITB MB. L, F. G. IILOE" 

Taking logarithms, the products n become sums S, or 
log (Pa/Po) = S logf{xi + A/ij, X2+Ah2, , . . x,^ + Ah^ ; c^ + ACi, Cg + ACo, . . . 4, + Ac^l 

Let the first summation now be expanded by Taylor's theorem, and typical terms 
up to the second order be written down. Then we have 

log (Pa/Po) = ^K S :^ (log/) + i (A/v)' S 1^- (log/) + A/i, A/v S ^;^ (log/) 

+ Ac. S ~ (log/) + i (Ac,f S — (log/) + Ac, Ac,, S ^~ (log/) 



s -"-^r 



+ A/i,, Ac^ S -.y-j (log/) -|- . , . -j- cubic terms in Lli and Ac + &c,j 



dl\dc^ 



wnerej' stands tory ^^i, ^^2? ^s? • • • *^m> ^i? ^2> • • • ^p)* 

Here t is to be given every value from I to m, and 6' to be given every value from 
1 to p, bat T and s in the third and sixth sums are only to be given values from 1 
to m and 1 to jp other than r and s respectively. In the above formula we may 
replace the sums by integrals, if we remember that the frequency of the system 
CC], x<i^ • • . x^y^^ IS simply T ox^ 0X2 • • • ox,^. 

Writing 

log (Pa/Po) = A,. A/i,, •— |B,, (A/z.,.)^ -f C,y M\ Ah,, + D, Ac, — JE, (Ac,)^ + F,,/ Ac, Ac,, 

+ G,,, A^^,, Ac, + &c. . . ., 

we will investigate the values of the constants separately. 
First, 

XjL^, ■ III • • • / J (A/JUj^ (a.Ju2 « • 8 (jjJUm^^ 

df 

■■ III . • . -. CviA/J \AJ%JL2 • • e \A/JUi^-jiy 

LhJb.., 

— — III • • • j / J CiXi C6.X'2 • • « wCv^.^^j Cv«x/^,^£ • ■ • CvX^Y^^ 

the integrals now not including one with regard to clx,, and [/] denoting that /is to 
be taken between the extreme limits of /for x.,. Now in most cases of frequency 
the frequencies for extreme values of any organ are zero,"^^ Hence [/] equals 
nothing. Thus we have A,. = 0. 

* In most cases, but not mvarmhly, as, for example, in tlie case of some florets and petals. In the 
cases, however, in which A,, does not vanish, the conclusions finally reached will be the same, as A,, only 
marks a change of origin for the constant frequency distribation. 



MATHEMATICAL CONTEIBUTIONS TO THE THEORY OF EVOLUTIOiST. 23 



o3 



Secondly, 



^S ■ JJJ • ♦ * y 7 ' ^*^*^l ^^'/- • • ^^^'>M 

^^^ jJJ ' • • Tf7 "^i "'^''^ ' • ^^'^mi 
d 

"~ T" JJJ • • • J ^^"^1 ^*^^2 ^^•'^»i 



dnldc 



Si 



where n is the total number of individuals measured, which is independent of c,. 
Therefore D, := 0. 
Thirdly, 

^^ry.yl -- — I 11 . • . y T~ ~ ~ LLiXj\ Ct'X2 ... CvtXjjj^ . a • • e I l.yj 

This will not as a rule vanish. If the frequency be normal, it will still not as a rule 
vanish. It will vanish either if the frequency be symmetrical about x., = 0, eind 
X, = 0, or if there be no correlation between the x,, and x, organs, i.e., \i f be of the 
form/i(.'r;') Xfz{x;). 
Fourthly, 

This will not as a rule vanish. It will vanish, however, if the frequency of .t,. be 
symmetrical about its mean. 
Fifthly, 

X-)^> - — - "~" III ' • • / ^) C-ttX'] LlJL'2 i • ^'^^'j}i .«».»» \ • y 5 

J_j^ ^^^ ""' JJJ • • * J n'V^ aXi Ci^'o . . * ^^vH \^^'/'? 

1 ,9^-' ^^^ JJJ . . . J^ ^j'^ y " U^l Ct^'o . . . rt^.,-,^ . . . . . . . \^'')} 

all of which v\^ill generally be finite, but admit, like Qy and G,.,, of calculation when 
the form of the frequency /* is given. Hence 

P^ — Po expt. — • -| {B.,(A/^,,)" — 2C,,r' A/2,. A4,/ — 2G,,3 AA,. Ac, 
+ E, (Ac,)" •— 2F,,/ Ac, Ac,, + &c. . ., .], 

X expt. (terms in cubic and higher orders of the A's) . . (vi.). 

This represents the probability of the observed unit, ^.c., the individuals 
(xi, X2, , . . x,,,, for all sets), occurring, on the assumption that errors A^^i, AAg? • • • 

VOL. CXOl. — A. 2 H 



^34 PROFESSOR K. PEARSON" AI^D MR. L. N. G. PILON 

^h,,,, AC], Ac^, . . . ACp, have been made in the detenaination of the frequency con- 
stants. In other wordsj we have here the frequency distribution for errors in. the 
values of the frequency constants» 

(3) Coi'iclusions to he drawn -f'rom the form of {\\,), 

(a) Tiie distribution of the errors of frequency constants^ if treated exacllyj will 
generally be skew, for the cubic and higher terms in tlie A^s do not vanish. If, 
however, the cubic terms a-re small as compared with the square terrnSj the frequency 
distribution of errors will approximate closelv to a normal correlation surface. 

It would be impossible to evaluate the remainder after the second power terms in 
Taylor's series for any general expression/' [x], x^^^ . . . x,,^ ; Ci, c^, ^ » » c,fj for frequency. 
In special cases we have found that terms of the third order amount in the most 
unfavourable circumstances to 4 per cent, of the terms of the second order, generally to 
a good deal less."^" Probably the series in most cases converges with considerable 
rapidity. The fact, however, that we are dealing with the first terras of a series should 
be borne in mind. It does not seem to have been sufficiently emphasised when the 
probable error of the standard deviation is taken to be 67"449/^/2n p©^' cent, of the 
standard deviation. The usual proof of this result, however, involves the same 
assumption as to the smallness of the cubic terms, 

(/3) Supposing the errors so small tliat v/e may neglect the cubic terms, we 
conclude tliat the errors made in calculating the constants of any frequency distribu- 
tion are— - 

(i.) Theniselves distributed according to the normal law of errors, 
(ii.) Correlated amoBg themselves. 

Both these conclusions are of the utmost importance. The Ih'st enables us to 
obtain the probable errors of the frequency constants ; the second depends upon the 
fact tliat G,,', G,,s, and F,,/ are in general not zero. The standard, deviations of, and 
the correlations between, the frequency constant errors can novv^ be calculated by the 
ordinary theory of normal correlation. 

Before, however, proceeding to these calculations, we may draw one or two other 
conclusicxns of considerable generality and wide significance, 

(y) Consider a race fully defined by the variations ctij lt^, cfs? - . .? &c. of the orgcins 
of its members and tlieir correlations rj.., v%3? '^^m > - — Now let a random small 
selection be made of this race, defined by 

where the magnitudes of 8(Ti, 80-2, Scr^, . . . h'j., on-j, ory>^, , , ., are quantities depending 

^ Soe, for the relative order of two terms of tlie Becond and third orders, '* Eegression, Heredity, 
and Panmixia/' ' Phil. Trans,,' A, vol. 187, p. 266. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORF OF EYOLUTION. 235 

on the magnitude of the probable errors of the variation constants, and therefore on 
the size of the selection. Then the system So-j, ScTs, ^cr^, , . . Svy,, h7\^^ SVip,, . . ., is not 
a system of independent variations, but the changes in variation and correlation are 
correlated together, since the terms F,,/ do not generally vanish. We therefore 
conclude that if a random selection be made out of a population with, regard to one 
organ (Xi, there will be tendencies for the variation of all other organs and the 
correlation between all organs to change also in certain directions, which can be 
definitely indicated so soon as the general population has been measured, and the 
effect of the random selection on one organ has been ascertained. What is proved 
here of random selection will bo shown to be true still more intensively for artificial 
selection,'^^ ^.e., every selection of one organ modifies in a correlated manner the 
variation and correlation of all other organs. It is impossible to alter one organ 
without altering all other organs and their relation to each other. 

(S) The remarks in (y) are not only true for a random selection of variation, but 
equally well apply to selection of size. This follows because the terms 0,,./ and G-,, do 
not as a rule vanish. If a random selection of 100 or 1,000 individuals be made out 
of a general population, then the mean size of any organ in this sub-group will 
probably differ from that of the general population. The result is that the size of 
all other organs, their amounts of variation and correlation, will probably liave values 
differing in definite directions from tliose of the general population. 

(e) Take two random selections out of a general population ; the probability is that 
they will have diflerent means for any one organ, and a result will be that they will 
have correlated systems of changes in the sizes, variations, and correlations of all 
other organs. In other words, random selection produces a differentiation of all 
characters, which differentiation will be the more marked the smaller the random 
selection, t 

[Q This principle — that a random selection gives a system of correlated changes in 
the deviations of all the characters of a species — seems, to some extent, capable of 
explaining the small but systematic differences to be found occasionally betvv^een 
closely allied species. It is not necessary to suppose them due to a long process of 
natural selection acting on a varietj^ of organs ; a small random selection, or possibly 
a natural selection of one organ, might suffice to produce the systematic differences of 
character in all organs. 

How far a succession of random selections would give an evolution, biassed by the 
first random selection requires further consideration ; but it seems impossible for the 
characters of a race to remain fixed under the influence of a heavy but non-selective 
death-rate. They will vary from year to year, although this systematic change of 

'^' "Memoir on the Influence of Selection on Correlation." Selection not only modifies correlation, 
bnt the selection of one organ can create correlation between orgaus previously uncorrelated. 

t Continuous artiiicia] selection of an organ produces a still more marked differenti?|;tion of all other 
characters, but this is treated of in another niemoir. 



236 PROCESSOR K. PEARSON AND MR. L. N. G, EILON 

cliaracfcers be not always in the same direction. Systematic change of characters 
produced by random selection may be spoken of as random evolution. Random 
evolution is theoretically a possible cause of systematic change ; experiment only can 
determine how great is its effectiveness in differentiating local races. 

{')7) In the case of a normal distribution of variation defined by the mean /i and 
the standard deviation cr, it has been usual to suppose that the error made in the 
mean is independent of the error made in the variation. In other words, it has 
been, assumed that G,,, vanishes, although no proof has been given, or possibly it 
has not been realised that a proof ^^^as necessary. In this case f is of the form 

t;^ ,-^^ expt. — \x^ 2(T^) and — - — : — :^ --T . whence 

= 2 "^ expt. — \^ll(j^\ -T^-T— dx = 0. 

Thus there is no correlation between error in the mean and error in the standard 
deviation. This assumes that we stop at the square terms in (vi.). If, however, we 
include the cubic terms, &c., product terms in ^h and Ao" do arise, and we cannot 
state straight off^ that no correlation exists, although it may be very small. In the 
case of all skew variation, such as is so frequent among plants and animals, a corre- 
lation will alwavs be found between deviation or error in the mean and the like in 
the standard deviation. In other words^ to alter the mean by se^Iection (artificial or 
ranciom) is to alter the variation of an orgaUe 

With the exception of the statements in this paragraph (17), the whole of our 
Sfeneral conclusions in this section are independent of any particular law of frequency, 

(4) On the Determination of the Probable Errors and the Error Correlations of the 

Frequency Constants. 

Let r^i, 7^2? '^3, . . .3 be the frequency constants, whether they be the means, standard 
deviations, or correlations of a complex of organs. Then if we neglect cubic and higher 
terms in the deviations A-^i, Lrj^^ A7J3, . , ,^ the frequency surface giving the distribu- 
tion of the variations in the deviations is 

P^ = Pq expt. — 4 [S [ayy[^^f},)^} — 2S{a,-.5 A'^,.A^,j], 
Vvdiere 

_ [[[ r^l^^^J) ^ ^ 1 7 

a^, — -~ ~~" ill,../ " o ■ ' ct/Xj aXn • • ' aXy^-^^ 



a,, 



It is required to find S„ the standard deviation of A??,, and R,,, the coefficient of 
correlation between A?/,, and At^,. 



MATHEMATICAL OONTBIBUTIONS TO THBl THF50RY OW EVOLUTION. 237 



Let A be the discriminant : 



a 



lb 



(^hu 



a 



31? 



^^12? 



a 



32? 



^'13? • 



^hZ} ' • • 



cu 



33? 



where a,-,, =^ a,^,. Further^ let A^,, be the minor of the r^^ row and s^^ column ; then 



■^2 =: A,, ./A and 11^., == A,,,/(AS,.'Ss) . 



(viii.) 



give the required values o£%, and R,.,-.* 

Further, the standard deviation of A?;,, for selected values of all tlie other A?/s 

is l/v/%r- 

This value can often be of service. Thus suppose a considerable number of 
skeletons found, but that only in a comparatively few cases is it possible to pair 
together the femur and tibia of the same individuals. Then the variations of femur 
and tibia in the race will be known with great exactness, but the probable error of 
the correlation between femur and tibia will be given with close approximation by a 

1 
form like '67449 -^ — . In. fact, whenever we have obtained from a large number of 

observations the values of the frequency constants of a race with great exactness, 
then, using these values to obtain an additional variation or correlation constant from 
a few observations, the probable error will be of the form just indicated, and not of 
the form '67449 A.,,,/ A, It is needless, perhaps, to remark that the former is far easier 
to calculate than the latter. 



III. On the PllOBABLE ErRORS AND THE COEFFICIENTS OF CORRBLATTON BETWEEN 

Errors made in the Determination of the Constants of a Normal 

Frequency Distribution. 

(5) In order to exhibit more clearly the method of investigation, it is desirable 
that a simple case be first taken, Accordingly we will start with the following 
problem : — • 

To find the Prohable Errors and Error Correlations of the Constants of a Normal 

Frequency Distribution for Ttvo Organs. 

Let hi, h^, be the means, cti, cry, the standard deviations, no the coefficient of correla- 
tion, n the total number of pairs of observations, Xi, x^, any pair of corresponding 
deviations of the organs. Then the frequency z dx^ dx^ is given by the surface 



z 



27ro-,o^l ~:7y ^""P*- ^ i of (1 iVy 



2o)Y'^::{r 



12 



9 



^1^2(1 



^'12) ^2 (t ~ ^12 



)}; O'^-)- 



'^ See *' BegressioD, Panmixia, and Heredity," ' Pliil. Trans.,' A, vol. 187, p. 301 (e). 



238 PROFESSOR K. PEARS0.:N A:ND MR. L N'. G. FILON 

We require to find 

S^^, 'the standard deviation of errors in o-j^ 
> ' ft' 

'^'^crg? ?? '' M *^ 2j 



^*^'-/',o5 ,*? 5 7 ;? ' 12? 



'1; 

which is the coefficient of recession of ^"j.'^" 
E;^,^^, the coefficient of correlation between errors in or. and o'2, 

{ri7'i2? ; ; ?; ;? .* > ^1 ?5 ^ 12? 



crg'/'ia? J 5 3 5 5 5 5 5 '^ li 5 ? 

2;,^5 the standard deviation of errors in h^^ 
^ h 

Il/^^/^^5 the coefficient of correlation between errors in /ij and 7^2- 



It follows that R;,^^p '^h,<r.,3 P^7^i<r,5 I^Vi? IV''io5 ^Vv'!,? ^^^^" ^^^^ zoro, since, by (ii.) of p. 233, 
Gys will vanish when the distributions of Xi and x^ are symmetrical. These correla- 
tions would not, however, vanish for the skew frecjuency distributions, which are of 
most frequent occurrence in problems of heredity and fertility in man, &c. 

The first stage in the investigation is to write down the second differentials of the 
logarithm of z for all quantities occurring in it. 

We find 

dcTi ^1 I ^1 (t "^ '^2) ^^"'if^S (t ""' '> 12) 



cP (log 5;) 1 






1 



O'tAji} j^j'AyyL'.'}/ 19 



0-^1-4) • o",c72(i^^ry 



d^ (log ^) -^-'i^Vi2 

f^cTi ddiy^ crfcr| (1 — ' rj.2) 

fZ^(iogr^) __ 1 [ 27'i2^^f ,a;ia:'2 (1 + 'f'n) 

r^cTi (rZ?'io G-j [al (1 -- r;\)- a^cr.^ (1 - rf,)' 

<^/^*^ (log 2:) 1 [ 2ry^ ^T^'2 (1 + ^2) 1 

6^(72 ^'^'12 <^2 1^1 (1- — '^'12)'" ^1<^2 (1 --" ^riX^J 

d^ (log ;<;) 1 4- ^'12 f + ^^^2 / ^1 I 4 \ s ^^'t2 4- 2rf2 ^1^2 

d^l, ^ (1 - ^!2)^ (1 - ^W \oi^ 4/^ (1 - ^!2)^ <r,a, 

d}(\ogz) _ ^ 1 

dxl Gf(l — rl^ 

d^ (log z) 1 

^4 "^ <^2 0- - ^12) ^ 

c^^(log;<;) ^'i2_„__ 

«^r% f/^jg <^]^2 (1 — ^'12) 

'^ See the memoir on " Heredity, Regression^ and .Praiiiiixia," p. 268. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 239 

Now these must be multiplied by z and integrated for Xi and x^ from — co to + co- 
hese integrations follow at once, if we remember that 

ncr{ = \\zx\ dxi dx^^, ncri ■=■ ^xt dxi dx^, '^'i'(ricr>/}\o = W^x-^Xo dx. dx^, 

by definition of cti, a^, and r^o. 

Thus we obtain the foUowdng system : 

11 2 — 7% n 2 — r% 



Ct 1 , ^ \t X ? Ci'-\'. 



ai 1- 


9 ' 

M2 


717% 


i 


o-iCTa (1 - 


X ' 

- 7'i2) 


717V2 




(72(1- 


9 X ' 


n 





^/^r,, 



12 



13 



^23 . 9 X ' ^^"i'i 



o-'i (1 - 4J 
^ (1 + ^ia) 
"(l"-^l2) ^ 



12 



'''' "-" cr? (F^ 7'?2) ' '''' '^ 'oi(r^7j^ ' ""^^ -^ cr^.a-^d ' ' ^""'^^ 

where the a's are those of Eqn. (vii.), p. 236, obtained by taking the above difierentials 
of the logarithms o{ z in order. 

We can now write down the correlation surface^ giving the frequencies of errors in 
the constants: 

, r 71 (2 — 7^19) / v.> . ^^ 2 — 7'?9 / \o . 7^(l 4- 7^0) . ,,, 
X expt. - -L -^ \ ^^- {A<r,y -\- ^, j-^ (Acr,)^ + 1^"^ (^^■>'^) 

L ^1 ^ "" M2 ^2 -1- ^12 V'- ^12/ 

2?i7'j9 27ir^.y 2n7% ] 

— - 7, '"o , AcTo A?-'j2 "^ """Ti — -"""."^ ^^\ A^V2 — ' — vr^—V^ AcTi Acr2 V . (xi.). 

a, (1 - 7^) a, (1 - rr.) G-^, (1 -- rjo) 'J ^ ^ 

Now several important conclusions follow at once from this result : 
(a) For the case of normal frequency (but in general only for this case) the errors 
in the means are uncorrelated with the errors in the variatioos and correlations. 
The error correlation surface breaks up into two parts, of which the first part we 
have written down involves only the means, and would coincide exactly with the cor- 
relation surface (ix.), with which we started, if we write in (ix.) Am^ for x^^ /^li^ for 
.^2j and (Ti/v^n, o-2l\/n, for ai and a, respectively. 

It follows accordingly that the standard deviations of the errors in the means ai*e 

tk^^ (Til^/n, t,,,^-^ (T2ls/n . .,'.... (xii.), 

and the correlation of the errors made in the means is 

^h^li.^ ~~" ^'l2 » ^XlU.j. 



240 PROFESSOR K. PEARSON AND MR. L. F. G, PILON 

Further^ the standard deviations for errors in hi when the error in li^ is known^ or 
for errors in ho when the error in hi is known^ are respectively 

7-(l — riy), "-/' (I — rl>) ...«..« (xivJ. 

The results (xii.) are^ of course^ well known ; the results (xiii,) and (xiv.) are, we 
believe, novel and important. An illustration may be of service. 

Suppose the stature and arm-length of a population to be under consideration. 
Let us suppose the mean stature of the population known from a great number of 
observations. Now let the armx-lengths be determined for a random selection of the 
population ; then^ if the stature of these individuals so selected dilTers ^hi in excess 
from the mean stature of the whole population, the armdength of the random 
selection will most probably differ from that of the whole population by AAo^ a 
quantity fixed by (xiii.), or rather by the coefficient of regression 

Thus 



with a probable error of '67449 X -j-; {l — t\<^> 

In other words, if the arm-length is to be found from a selection of the general 
population only, and the stature of this selection differs from that of the general 
population, it is most reasonable to take the arm-length of the general population to 
be the mean arm-length of the selected population less the quantity n2<5"2/cJ^i^A/ii. 

Or, again, if a selection from a general population show a mean organ 4^.j. in excess 
of that of the general population, the whole system of correlated organs will exhibit 

changes of which the magnitudes are most probably given by the type ~^-~^ ^Jii. 

The bearing of this on what we have termed random, evolution will be obvious. 

(/3) Turning to the second part of the error correlation surface, we note at once 
that, if Uvo organs he correlated, random selection will give a system of correlated 
deviations in their vernations and their correlation. 

Eandom selection (and a fortiori it may be added, artificial or natural selection), 
which alters an organ's variability, alters the variability and correlation of all other 
organs. In fact, when it is once realised how two random selections from a general 
population will as a rule have organs of different means, of different variabilities, and 
of different correlations, the means among themselves and the variabilities and corre- 
lations among themselves forming systematic groups, it becomes obvious how any 
assumption of the coefficient of correlation as a constant for local races runs wide of 
the mark ; and this, whether natural or random evolution, is to be looked upon as 
the source of the observed differences in character. What chiefly concerns the 
biologist in this matter at present is this, that even a random selection of one organ 



MATHEMiVnCAL CONTRIBUTIONS TO THE THEOEY OF EVOLUTION. 241 



will produce changes in all other organic characters, which, if small, are still sensible 
and capable of quantitative expression/^ 

(6) Returning now to the algebra of our investigation, we have to discuss the 
second part of (xi.) by aid of the formulae given in (viii.), p. 237. 

We require, in the first place, to evaluate the determinant A. 

Now 



n 


'•^ - 


^% 






1 - 


r%. 


7 




nrl. 






(Tjcr 




- r 


•?.2) 



?17'j2 



n 2_ 
1 1 



a 



9 
' 1 •) 



12 



a, (1 ^ ry 



nQ\^ 



m\. 



or, (1 - 7i) 



<To{l — 7'\^ 



^2 (1 - ^2) 



Divide the first row by 



n 



the second row by 



n 



. — :t- j the first column by cti, and the second by cto. Hence 



J. , the third row by 



m •> 



A 



9^^' 



^?o-|(l ~.ry=> 



2 



9 



^^.7 



•- ' /if 



2 - f- , 



12 

2 
12 



12 



^' 



12 



T, 



12 

1 + rl. 



1 



12 



Subtract the second column from the first, and then add the first row to the 
second ; we have 



A = 



01:' 



o-;cr; (1 - rfo)" 



2 






iw^ 

aiai (1 - 972)' 



The minors are now easily found to be 






12 



2(1- ri,) 



n-i 



' 12 



,. 



An 



A.. 



A 13 



2?^^ 



cr| (1 — rfo)'' 
0-iO-i (1 - rjo) 



XX09 — • 



-99 



X3L.10 



-12 



_ 2f^ 

2nh% 
0-10-2 (1 -- 7^ ' 



A 



2^^-'! 



2/,. 



12 



23 



o-^o-f (1 — rfg) 



,2 \2 



'^' Take (xvii.) below, for example; it expresses for tlie first time quantitatively the important 
biological principle that, if a group bo selected at random from the general population, and it has more 
variability in one character, it will be more variable than the general population in all other characters. 
VOL. CXCI.— A. 2 I 



9, 



PKOFESSOE K. PEARSON AND MK, L, N. G. FILON 



From these expressions the values of the standard deviations and error correlations 
are at once found by (viii.). 
We have 



cr^ 



%^ 



'or, 



.'9^,, ? -"'o-.i 



V -— 



^ 



■r,., 



E 



1-2 



(XiO'- 






V 



^2 



^^<r^r,. 



o 

' 12 



^2 



R 



0^;i''l« 



\ 



/2 



(xv.), 
. . , (xvL), 

(xviii.). 



The result (xv.) is, of course, old ; the results (xvi,) to (xviii,) are novel, and lead 
to interesting conclusions, which are considered in the following paragraphs, 

(a) The probable error of a coefficient of correlation Vi^ is '67449 (1 — t\2)I s/'n. If, 
therefore, the correlation between two organs is less than once to twice •67449/<^/nj 
they cannot be safely assumed to be correlated at alL 

(/3) If we know definitely the errors made in ai and 0*2-— if? for example, we know 
the variations accurately—then the probable error of t^^ is that of an array of i\J^ 
for definite Ao-j and Acr... It is given at once by the coefficient of {i^Tyif in (xi,) as 



'67449(1 -r?3)/v/{^^(l+n2)}. 



(xix.). 



This is the value given for the probable error of 7^12 bj one of the present authors 
in a former paper. ^^ At that time he had not fully realised the importance of the 
principle of the correlation of errors made in determining the magnitude of 
frequency constants. 

The following table will enable the reader to appreciate the difference in magnitude 
between the '^ absolute'' and '^ partial" probable errors of rjg ' 



12 



1 



4. u# 



(1 »- -/•j2)/\./(l + T^ 







1 



a 



'9 



'3 
"4 
'5 
'6 

■/ 

•8 
•9 



•99 
•96 
•91. 

•84 
•75 
•64 
•51 
•36 



•19 



•985 

'87 
78 
•67 

•DO 

•42 
•14 



1 











* " Heredity, Regression, and Panmixia," ' Pliil, Trans.,' A, vol. 187, p. 266. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 24 



r> 



It will thus be seen that when 1*12 i^ small the absolute and partial probable errors 
are both large and nearly equal ; that when r,2 is large the absolute and partial 
probable errors differ more widely, but, as both are small in this case, their difference 
is not of much importance. Bearing these facts in mind, it will be found that the 
reasonmg based on the partial error in previous memoirs remains valid, even if the 
partial error be replaced, as it generally should be, by the somewhat larger absolute 
error /^^ 

(y) If we know definitely the variability of any organ, and we take a definite 
group of the general population to find the variability of a second correlated organ, 
then there will be correlation between the deviation of the variation of this group 
with regard to the first organ from the variation of the first organ in the general 
population and the deviation of the variation of the group with regard to its second 
organ from that of the general population's variation for the second organ. This 
correlation is measured by rf^, and thus, if the organs be slightly correlated, it is 
small ; but if the organs be closely correlated, it is large. Suppose, for example, we 
know the variability of the tibia, and require to find that of the ulna from a com- 
paratively few specimens. Let cti + Acrj be the variability of the tibia in the 
specimens for which the ulna can be measured, and r^g the correlation observed 
between the ulna and tibia in these specimens ; then, the variability of the ulna being 
observed as 0-2 = 0*2 + ^<^2, the most probable variability of the ulna in the general 
population is 

or 

f 7'] 2(7.2 

o-z « Ao-.,, 

or, since the second term is small, we may write 0-2 for (7-3, and the above expression 
equals 

=: cr./ ( I — 7'% 



CTj 

\ ^1 



For the long bones, r^o = *9 roughly, and therefore we have the ratio of variability of 
the ulna in the general population to the variability observed in the group 
= 1 — '8 AoTi/cTi. 

It is clear that this expression also measures the change in the variability of the 
ulna due to a random selection of tibia. 

(S) Although the correlation between deviations in the variability of two organs from 
their mean variabilities only varies as the square of their correlation, the correlation 
between the deviations in the variability of an organ and in its correlation with a second 
organ varies as the first power of the correlation of the two organs. In other words, 



* See, for example, the reasoning as to the Bon-constancy of tlie correlation goefficieiit for local races 
in ' Phil. Trans,,' A, vol. 187, pp. 267 and 378. 

4^ i Ji 



244 PBOB'BSSOB K. PEARSON AND MB, L, 'N, G. PILOF 

while a selection of variability may produce only a small or moderate change on the 
variability of correlated organs, a selection of correlation or a selection of variability 
is likely to produce considerable changes on variability and correlation respectively. 
Let cTj, (72, t'i2, be the mean values of the standard deviations and the coefficient of 
correlation for any three organs ; let <xi + ^^i? (^2 + ^or^, ri2 + ^'i'm t>e the like 
quantities for a group selected at random, I'hen tbe principle of regression tells us 
that most probably 



A<7-| = 


~ -»'Virj. V *^' 12 


A'/v,- 





'0-1 



Substituting the values given by (xv.) to (xviii.), we find 



A -™— . 1 ^'^'^ A/. 

rPR?«» 'alt ^\-^\. • / e 



Tjo — ^12 ( I •— " T12) 



<T, 



Now these equations lead us to some important conclusions. In the first placOj if 
the correlation be very small or very large, then a random selection of variability {Acti) 
makes only a small change in correlation (Ari2)e The change in correlation for a 
selection of variability is greatest when ^12 = ]/\/3, and then is approximately 
•385 AcTi/cTi, or over 6 per cent., if Actj/cti were as high as yq. On the other hand, the 
change in variability (Acti) due to a selection (Arvz) of correlation is small if the 
correlation be small, but increases rapidly if the correlation become, nearly pei^fect. 
(3f course, for perfect correlation the probable error of r,2 is zero^ and accordingly it 
is infinitely improbable that a selection can be made with Arj2 differing from zero. 
But if ri2 be not unity, then a selection in which Ar^^ is large, however improbable, 
will give very large changes in the variability, if r^g be very large. Our conclusion is 
accordingly that considerable changes in variability are likely to be produced whenever 
there is a correlation selection among highly correlated organs. 

(7) To find the Probable Err 07- of the Regression Ooeffieient Jar Two Organs. 
The regression coefficient pi is given by 

Its standard deviation t^^ is given by the summation equation 

(tj' — S {Ap,f/ih 



MATHEMATICAL CONTRIBUTIONS TO THK THEORY OF EVOLUTION. 245 

To find its value we adopt a method, which we give on this first occasion at 
lengthj as it will be frequently used in the sequel. 



Take logarithmic differentials 



• 



Ap, Arj2 A 0*1 A<j2 
Pi '^'12 ^i 0-2 

Square and divide by n aFter summing 

S (Api)r _ S (An,y S (Aoy): S (Ao-,)'^ 2S (Av-^, Ao-,) _ 2S (Ar^, Acr,) _ 2S( Ao-jAo;,) 
^/^pf ?^rf2 7i(r'| ?^cT•| '^^'^2^1 ^^^'i2<7'2 7^(rlO'2 

Now, remembering the definitions of standard deviations and coefficients of 
correlation, this may be written 

Pi ^'12 O"! Cri CTiCTg ^'igO-i ^120-2 

Now all the quantities on the right have already been found in equations 
(xv,) to (xviii.). Hence, substituting, we have 



pi ^^^4 



Hence 



'^ a-. 



^Vi^-^^j • • (^^'•)- 



Thus the probable eiTor of a regression coefficient 



•67449 "' ■ /'^ ~'''^ 



-:-:a/( 



n 



This is of fundamental importance for testing the significance of results obtained 
by applying the theory of regression to problems in heredity, panmixia, &c. 

The probable percentage error in a regression coefficient = — ~~— "^ ^^ and 

hence is small if the correlation be close, and increases rapidly if the correlation be 
small. This again illustrates the point to which reference has been made in another 
memoir,"^^' namely, that when only a few individuals can be measured, the most 
reliable results for the purposes of the quantitative theory of evolution are to be 
found from the measurements of the most highly correlated organs. 

'^ Pearson and Lkk : '^ Correlation in Civilised and Uncivilised Races," * Roy. Soc, Proc.,' vol. 61, 
p. 345. 



246 PROFESSOR K. PEARSON AND i\lR. L. N. G. FILON 

Attention should be drawn to the fact that we ha.ve replaced errors by differentials. 
This is only legitimate so long as product terms in the errors are negligible as com- 
pared with linear terms* This is the assumption almost universally made by writers 
on the theory of errors/'^' It will not lead us astray, so long as we take care in any 
practical applications to verify the smallness of Arj.., Acti, Aa^^ as compared with 
^12, CT], and (To respectively. 

(8) 7h find the Prohahle .Erroi\s mid Error Correlations of the Constcmts of a 

Normal Freciuency Distribution for Three Organs or more. 

It will scarcely have escaped the attentive reader that our investigation hitherto, 
only involving two organs, has left several important problems untouched. For 
example, it has dealt only with the direct effect of random selection. But we may 
ask such a question as this : Wlmt is the change in the correlation of two organs 
when the variability of a third is randomly selected ? Or again : What is the 
change in the correlation of two organs when the correlation between one of these 
and a third, or between a third and a fourth, is ranclomlv selected ? All these are 
important problems in the theory of evolution. 

The general equation to a normal frequency-surface for m organs is :— 



% 
^ 






- expt. — I ~-\ En 7 + R22 ^T + . . . + 2R12 -^" + • • • 



where B is the determinant 



). 


rvi 


'>'n 


1 




'i-m 


e • 


t, 


'>\..i 


"l'r,i'i 



'^^V6 ' ' ' ^"jiM 



^23 ' ' ' ^'2m 






1 



and li,,> is the minor of the term in the 6'^^' row and s^ column. 
We require first to find the quantities like (vii.) of our Art. (4), 



log z = log {nl{2^- _ ) I log R - S. (log a.) - ^. (f f ) ™ B.. (f ' 



'ss' ^<'V'-<^V 



d (log ^) __ 1 



Ih,^, ^ Ih,, ..t^^, .... (xxii.l 



Rct'i * \ R a 



* Gauss, aclmittedlj, ' Theoria Combmationis Observaiionum ' .... p. 53, Problema ; Laplace and 
Poissoi^, actually but obscurely; see ' Theorie analyfcique des Probabilites, Liv. IL, cliap. lY., and 
'Recberclies sur la Probabilite des Jugeiiients,' cliap. IV.; more clearly in Todhufier's' account, 
'History of Theory of Probability,' Art. 1,002 et seq. Further, Ckofton, x^rfcicle Trobability,' § 48, 
for a like assumption. 



MATHEMATICAL CONTRIBUTIONS TO THE THEOEY OF EVOLUTION. 247 

""^.s^,r "^ '^ ^ '''di^ "" ^ d^r V iwf / "" ^;" '^^^' \E^) • (^^iii-)- 

Differentiating the first of these again with regard to (r^ and summing for all 
possible values of x's, we find 



ci 



11 



I j . . . . ^^ ax, ax, . . . ax,, ^ ^^ 1 1 ^^ .^^ i 



But 



Hence 



R = Rn + S,(R,,r4 



n /E + E 






11 



(xxiv.). 



Differentiating (xxii.) with regard to erg and summiogj we have at once 



. . . ^' -y----— da;| dx2 . . . <i^,, = ai2 = — - - -^T^' . . . (xxv.). 

Differentiating (xxii.) with regard to r^. and summing, we have 

J . J • - '^^ da, dr^ '^""^ "^^^ * • ^ "^^^^ "- ^'^'^ "^ ., 1 r^r,, \ E/ + ^^"^^ ^"^ r^r,, \ E / 

cTj U?r,o\ E / E J ' 

or 

A2 = — '?iRi2/0'iIi (xxvi.). 

Differentiating (xxii.) with rega-rd to r^^ and summing, we find 

_ '^^^ ^^ K,i , n ^^ Q.A'»^-^'A 

_ r^. f^ /E„ 4- S, (Ej//\,)^ 



0*1 «?'2,> \ E 

or 

l6^23 = (xxvii.). 

Our next step is to differentiate (xxiii.) with regard to 7\2 and sum. We have 



248 



PROFESSOR K. PEARSOW AND MR. L. N. G. PTLOX 



-7 



cl? (log z) 

a/ 12 



LitAj\ . . » CtJCyi^ -«— •'~~* 12'^ 12 J 






%__ cP (log E) 



2 r^ 



^2. d' 






d /EiA 



. T~ S,, ":^' I 4" 2n -r— -Iff ) — nB,,> , ,, 






/ / 



Now, since R =: En + S, (Ilj,r,,). 



and therefore 



Wlxv ^=: b^,^ (jA'.^.,,) "4*" 2b.^,,/ (l\,s' '^',,§7; 



k5s,^/ (-tv.^..^./ Tg^ij Ixj -— t|- W' "-"-' -9 b,>>.<> I^M'^^^,/ -ti-je 



Substituting, we find 



hio 



r^'-'i 



n d^ (log E) j^ /E,2\ 

2 d.7% "^ ^^r,. \ E / 



J1 



But ™^(logR) 



1 (f(>E -, -j^ d.lx , 

z~" - - and 2iiio = 7^ ; hence 



Thus finally 



^^^ (log K) _ o '^ ( Rh/K) 



.2 
'12 






12 



.1 wt X w 



^i /E-iaX 2Ef2 — EE|.2 12 



X 



xviii.). 



where R12, 12 is the second minor^ foinid by striking out from R the first and second 
rows and columns. 

In the next place let us differentiate (xxiii,) wnth regard to rjo and sum. We find 
in precisely similar manner 



♦ • • iV 






or 



Similarly^ we deduce 



h 



12^^34 



m 






^t^ /Ev: 



n 



r^7' ' 



12 \ •■ 



^18 

E 



? 01 ^ 



l;!?>i; 



n 






i'j I XVm ' 






71 -T- 



i * ^ 






n 






n 






s « & 



* « I a2v^« / 9 



This completes all the possible types. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OP EVOLUTION. 249 

We are now in a position to write down from (vil.) of Art. (4) the complete 
frequency surface for errors in tlie constants of a normal frequency surface for 
m organs. It will suffice to write a type of each term. We have 

P^ == Po X exponent — ^ i - - -^-^ ~-~~-- + ~^^ ~-~^ — - 



crj Xt ^i<5"2 



97>a pp 9P A^ 
I I -^-^Ha """ J^^-»^h2, 12 / * „ \2 I I ^iil2 i^O-jL . 

^ . _ ^ ^ - ^/^) j^^ -J« _ , _| ,^- -- ^^.^^ + . . . 

+ 2 ^-^^^-^^^ ^--^" dn^ Ari3 + . . . + 2 ^' '^'^ ^-'^ Ari2 An4 + . . . K^xxi.). 

Now this result again seems at once to give conclusions of considerable importance. 
Thus :— 

(a) Since there are no terms of the type Acr^ Ar.^, we infer (i.) that the random 
selection of variation in one organ will most likely only vary the correlation between 
two other organs by terms of the second order ; and that (ii.) the random selection of 
correlation between two organs will in all probability only change the variability of 
a third organ by terms of the second ordei\ 

(fi) The selection of correlation between any two organs will most probable vary 
the correlation between a second pair, ie., terms exist in A?V2 A34, &c. 

(7) The selection of the variation for any organ varies the correlation between 
that organ and a third organ, and vice versd the selection of correlation between two 
organs changes the variability of both organs. And lastly 

(8) The selection of correlation between two organs varies the correlation between 
either organ and any other organs. 

We may exhibit these results more clearly by taking four special organs, say, 
femur, tibia, humerus, and radius. Then a group having the variability of its femur 
different from that of the general population, will also have, in all probability, the 
variabiHty in its tibia, humerus; and radius different ; the correlations femur-tibia, 
femur-humerus, and femur-radius different ; but those of tibia-humerus, humerus- 
radius, and radius-tibia only slightly different. Further, a group having the 
correlation of its femur-tibia different from that of the general population, will also 
have all the other correlations, humerus-radius, femur-humerus, femur-radius, tibia- 
humerus, tibia-radius, different from the values for the general population. Further, 
the variabihty in femur and tibia will be changed ; but in all likelihood the variability 
in humerus and radius only slightly changed. 

These general conclusions, which seem to cast considerable light on the manner 
in which selection influences the variability and correlation of organs, must now be 
reduced to quantitative expression. 

VOL. OXCI.--A. 2 K 



250 



PROFESSOR K. PEARSOIST AND MR. L. N. G. FILON 






O 

<? 



o 



o 



55 

,c4 



b 






^■^ 



b 




•^^ 

SJ 

^ 



^1 



?T j /^-i* 









^-^-- 




X 



CO 









N 






or 



6" 



01 



>J3 



■40 

. CO 

.0 



•<«»/ 






■*0 

(-0 

o 






a^ 



"'^ 



o 



o 

o 



<1 



.4^ 





0* 




G 






CO 






■^ 









CO 





4^ 




^c 




\»^ 






^ 


<D 




CO 




^ 
-p 




rO 


(^ 






c» 


'^ 


^ 


i/> 




1: 


'^ 





I— t 


6 


S 






















O 4J 









O 

O 



w 

CD 







03 



bJO 





o 



1-0 



b 






of 
. b 



c^ b 



.^ 



'^ 






b 

b 



I— I 






p^ 



b^ 



b 






P^ 

<Wr-4 

b 









(<> 






<7i 



II 



Ph 



b 



O 



■?e i b 









pq 1 05 
b 



05 






Ph 












P5 
b' 
b 



*i 



O 



P^ 



<si i b 



o 



« 



9^ 



■ST' 






o 



^J 


J- 


f^-^ 




f-H 


n 


b 


.^ 


b 


b 




b- 



Ph 



Ph 
b 



P^ 



5^ 



r^ 



/^at 






b 



o 



<^ b 



o 



o 



HH 



rH 
HH 



P^ 



b 



b" 



P^ 




P^ 




<M<M 


s? 


a^iei? 




b 




b 


5^ 



.« Ph 
b "^ 

b ^ 




r-^ 






01 



C5 i 




! *1 






Cr 









Ph 



*>* 













M 



^>«i 



b P^ 

b ^ 



b HH 

b " 



b 









Ph I ^ 



^J.^ ■ «i 









f»>5 






50 



"Nvi 



C-1 1 



^ f^ 






S-3 






Ph 









b 



rs^i Ph 

^ i K 


















/^«i* 












f-^ 



^, 



O 



vH 




^ 




•!-*( 




*» 


^.H 


C-l 


r-s 




^^ 




>— 1 


HH 


,»-H 


f— 1 


■s*_ 


.^ 




,.- 




^y 


^^-— '" 


*"~-- 


_,— ' 




,— -^ 




,<•««, 

^ 








- 
5-x 


(th 


^*..> 


^.~*^ ■ 












HH ! 






h'H 







1— '. i ,«^-6» 




4> 






g 



X- 



•T^ I r^.^ 



P-H p^l Ph p:^ |Ph 



<> 1 ^S» 



^ ^ 






^ 






^ 




^ 


'^ 


^■^ 


^'^^ 


1>I 


-^ 


■O" 






Ph* 



1 ■ ■( 

b 






fS> 



p^ 



O 



CO 

13-1 

r 1 



pH 



e-» 









b 






l'\.^ 









p^ 1^^ p^ Ph PhI'^ P^ jP^ 









^ I b 






' H, 



b 



O 



o 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 251 

Divide each row by n, the first, second, third, and fourth rows by cti/o-o, cr-^, 0-4 
respectively, and the first, second, third, and fourth cohimns by ctj, <t2, Ca, 0-4 
respectively. Multiply the fifth column by Ty, and subtract it from the first and 
second ; the sixth column by Vi^ and subtract it from the first and third ; the seventli 
column by r^^ and subtract it from the first and fourth ; the eighth column by To^ 
and subtract it from the second and third ; the ninth column by i^^ and subtract it 
from the second and fourth ; the tenth column by r^^ and subtract it from the third 
and fourth. Remembering that 

R — Rj^ 4. ri2Ri2 + rigRia + niRn, 

R :^ R22 + r2iRi2 + r23R23 + ^^24R24. &C., 

and that 

E + ''\lr,X^)^ ''dr,X^) + ' '' dr,X'& 

(I /Eis\ , cl /E.,,\ , cl /'E3A d /E3s\ „ 

f<ri2 \^ J dryi \ E / ' dr-y,^ \ E / di\i \ E / ' 

we find, writing di-^ for d/dvi^, Sec, for brevity, 



A 



#» 



crfo-|<r|<7-| 



X 



5 



5 



2K„/R, 0, 0, 0, Eis/R, E,3/R, Ru/R, 0, 0, . 0, 

0, 2R22/R, 0, 0, Ri^/R, 0, 0, R,3/R, R24/R, 0^ 

0, 0, 2R33/R, 0, 0, Ria/R, 0, R23/R, 0, R3,/R, 

0, 0, 0, 2R«/R, 0, 0, R,,/R, 0, R,,/R, ^^JR, 

d.m dm dm dm dm dm dm dm dm d.^^ 



i " 



<'.-(i).<i)'*.<i).'^..(i>*{i)*{iv..(i*).^'»(i),<'..(i).*#' 



'31 



''.<l)*.(t).<'<l).<'.<l).<'.<|><^»(|").<'.{|)^'.<|).<'..(|).rf..(|, 

■''<l).'^<l).''»(li.<l)*4l)*<|)^..(|)<4.(|).<i»(|j.'4.(| 

</,.(|)</„(|),*{|),<J„(|),d.(|)<.(|),c4(|)..(|),c^|) 

*'(l)' 'Hi.)' ''-(¥)■ *'(lj' ''"(I)' **(t)' *"(I) '-'"(^)- *•( t). *<(!")■ 

2 K a 



252 



PROFESSOR K. PEARSON" AND MR. L, N. G. FILON 



Here the signs of the terms in the last six rows have been changed from minus to 
plus/^ Now multiply the first four rows by R12/R and add them to the fifth row/ the 
first four rows by B13/R and add them to the sixth row^ the first four rows by R14/R 
and add them to the seventh row, and so on. Then, rememberintT that 



'12 



^^12 



cl 



12 



l^j: 



(L. ^ 



E 



'E 



22 



It 



E 



'34 



E 



~5jlXi'|x('|2 it. Li 



Jl\> 






E dr 



n 



12 



91? Pl 1 /7T? 



E^ 



E clr 



B 






v/t « C-X/\J s ) 



and 5 further, that R^^, does not contain r^.j so that d(Rpp)ld7\, ~ 0, we have^ taking 
a factor 1 IR out of each row. 



(_1)%M 1 


2:r„, 


0, 


0, 


0, 


-"'i2) 


Rl3, 


J^JH9 


0, 


0, 


0, 




0, 


2 0.225 


0, 


0, 


Rr., 


0, 


0, 


-r^23i 


li.74, 


0, 


i 


0, 


0, 


2XV335 


0, 


0, 


1^135 


0, 


•1^23? 


0, 


Rmj 


, 


0, 


0, 


0, 


2ii4t, 


0, 


0, 


J^14j 


0, 


ii243 


.1:1/345 




A 


A 


aivgg J 


rfR-M 


dlli2 


f?Ei3 


fffil4 


r/lbgg 


ttE24 


^P^M 



0, 
0, 

dru 
clEu 



0, 
0, 



Cvf 94 (a/i k 



clf 



12 



0, 



ch 



'83 



11- 



0, 



CZEq 



00 



0, 






<^r 



12 



ir 



12 



rZE,9 cZE 



0. 



;i2 
LiyXVio 



43 



^r^g cl^ 



C?f 



14 






0, 



(^r, 



23 



wXli 



12 



f^r, 



24 



84 



ttK|2 



r/E 

CtXtig 



c^E 



ClfjAliA 



- I 



df 



12 



34 



dr-u d'}\ 



34 



34 



dn^ 

CvXVoo 

Cii'i l>g 

^Egg 

Ct'Xl'k)':} 

dTu 



m\2, 
dru 

tt'XVoA 

dvu 

tt'XX'24 



(^'r« 



- js 



24 



fM 



^24 






tt'XV': 



'34 



C6E31 



ei^ 



28 



tt^XX^.f ChlXj^ 



'84 



CZ?', 



94 



ctXvfij, 66 Xi" 



84 



r^f 



84 



84 



/ "v--y "^ 111 I 



The form of A is now clear for the case of any number of correlated organs. t 
(10) Case (i). Let us evaluate A and its minors for the case of three correlated 
organs. 

* The factor ( — i^^i^d^"'^^ must be introduced if we deal with j9 organs. 

f We may reduce this determinant as follows to one of the 6th order. Divide the Ist, 2nd, 3rdj 4th, 
5th., 6th 7th, 8th, 9th, 10th columns by 2Rii, 2R22? SRgg, 2R44J R12, Ris^ R14? Egg, R24, R34 respectively. 
After this division, subtract the fii^st column from the 5th, 6tli,^and 7th, The determinant reduces to 
one of the 9th order : subtract the first column of this new determinant from its 4th5 7th, and 8th 



MATHEMATICAL CONTRIBUTIONS TO THE THEORF OF EVOLUTION. 253 
In this case R = 1 — r|. •— 7^1 — ri2 + 2r23t*3iri2, and we have 

-Cvu ^ '23? *-^22 *- '13 J -1^33 -•- '12? 

23 '3l'l2 '23) '-^31 ' 12' 23 ']3> "12 ' 23' 13 '12j 

whence we have 






•^(1 '''23/> 0, 0, ^W)3 ''l2) '''l2''^23 ■"" ''l3) ") 

0, 2(1— rfs), 0, r237-i3~n2, 0, nirn — r^i,^ 

^J ^j "^ ^^'12 ' ^y "^'235 "^^SlJ 

U, ^'13? ^^? ' 23> J- J '125 

Divide the first three columns by 2 ; multiply the last row by 7*23 and subtract 
from the first row ; the fifth row by r^g and subtract from the second ; the fourth row 
by ri2 and subtract from the third ; we find 



column, it again reduces by one degree. Repeat the process twice more, and after a slight rearrai^gement, 
the fact that d (n^,)ldr^, = being remembered, we have, if f„„ == R,„/ v/RJI^, 

A — __!?__„ ^l\J^22^33^U Ti Ti T) Tf Tf Tf 



X 



i;.^"^^"' 4;'"^'^'" Jr3^°°^'"' •• - 



T^ log fi, , 



the general run of terms being obvious. 

In precisely similar manner the value of A for 73 organs can be written down, its degree being p less 
than the form given in (xxxiii.). We have not succeeded in reducing A for the general case [since 
writing this, Mr. Arthur Berry, of King's College, Cambridge, has succeeded in reducing the deter- 
minant for^ r= 4, and also in showing its relation to elliptic space], but we feel fairly confident that its 



value will be found to be 



^14 . . ] trlW'-^-' 



234 






PBOFESSORK. PEAESOISr AND MR. L. K G. FILON 



\ 



8^/t«(-iy 1 



" 2 "2 
0-i<To(7.( 



E« 



1, 

0, 
0, 
0, 
0, 



r.,y. 



0, 



;>3 



0, 



0, 



;■ 



0, 



\o7 



0, 


— ''l-iJ 


'^"j3) 


'^'233 


0, 


— ''-ri, 


'l-is, 


■~" '-^23? 


I, 


rm 


ru, 


— ^'2S, 


■ Vl2, 


™l, 




^'ab 


0, 


'>n, 


-1, 


' 12? 


0, 


fu, 


' 12? 


-1. : 



Add the first column multiplied by Ty^ to the fourth ; add the first multiplied by 
Vy^ to the fifth ; and subtract the first multiplied by t'23 from the sixths the determinant 
will reduce to the minor of the first row and colunrn. Continuing this process twice 
more^ we ultimately deduce 



« I XX a1 V . )« 



8m-«(-1)" 1 


- Bai, 


"™" -^^12 3 


-Ri. 


8^6^^ 1 


cr'f,7i<rl BP 


"^ -1^12? 


"^ -^^221 


— B23 


~ (T\<Tlal R* 




T" xli3? 


"^ Xi23? 


™ II23 





We will now proceed to calculate such of the minors of A as will give us results 
beyond those obtained for two correlated organs.^ 

We require the correlation of cxi and r^^^ and of r^^ and r^^. 

Taking cti and r^^ we must strike out in (xxxii.)^ as we have taken only three 
organs, the 4th, 7th, 9th, and 10th rows and columns straight off, and for the required 
minor the 1st row and the 8th column. We have then for the minor M (0*1, rsa). 



M (oTi, r,,) 



OV' 



2 2 

J. i O 



"IT ' 



•^^12 

R 

"r 

0. 



It 






R 



K 



12 



E 



\x 



0, 



5^ 
R 



E ' 

E 4- E^s 

Xt 

0, 

R ' 

R ' 



I> 

E 



0. 



Co /111 9 

6? / E^2' 

ft /E|3 

tZ/'g.j \ E 



0, 



E 



lo 



E 



d 



m\^ 



d 

dr 



13 



d 



dr^^ 



E 
E 

''T> ^ 

E 



To reduce this expression take the third row multiplied by rjo from the first, and 
the fifth row multiplied by Voz from the first. Then take the fourth row multiplied 
by ri3 from the second, and the fifth row multiplied by ^3 from the second, then 



remembering that 



* As a matter of fact all the minors were worked out and the results of (xv.) to (xviii.) thus verified. 



MATHEMATICAL OONTRIBUTIOKS TO THE THEORY OF EYOLTJTIOISr. 255 



and that generally, 



e find : 




M {cTi, ^23) = 


rJ' 


" 99 



JAj — i\j22 ~r ^12 1^12 "T" '^"23Av2Mj 



fl yl.\,gi,pt I X\j) d 



di\. 



rp 



0, 



0, 



E ' 



E' 



0, 



2E, 



^22 






0, 



Xtj^ 12 



0, 

E ' 



'<Ji\'„' 



KM . 



0, 

Jtv, 



0. 



T). ? 



E 



> ^ 



d 



dr^ 
d 



R 



df 

A 

di 
d 



12 



12 



'R \ 

R / 

/ 

R/ 



'R 



12 



d 



12 



dr 



00 



E 



^j^ / It? 






df 
d 



13 



12 



^^22 



R 

p \ 

R / 

Rj3 



' r 



» \ 



E/ ' 

Tf/' 



,.-t. 



f??Vn \ R 



Multiply the first two columns by E12/II and subtract their sum from the fourth ; 
multiply by the second two columns by R13/R and subtract their sum from the fifth ; 
divide out by the factor l/Rl We obtain 



M ((Ti, ^23) 



n/' 



^-^ Rb 



(7, criers n" 



0, 2E,2, 0, 



0, 



-1)1 o. 



-t»^13j 



0, 



Now remembering that 

R = 1 






rti 



substitute the various terms, and we find : 



Uj -iuJlV33} 



Ri 



12? 



0, 



R 



^2 



«j7 



0, 



lii3, 



-A-V23> 



dM^^ 


rfRgg 


dn2 ' 


*^13 ' 


dF^BH 


e^Rgg 


dn^ ' 


^^3 ' 


d^.,. 


r?Ri3 


dTn ' 


^^2 ' 


ttRj2 


rfRjg 


dn^ ' 


^^3 ' 


rmi2 


cHlie 



dr. 



23 



/ 12 -r ^/23' 31' 12? 



? 



M (0-1,^23) 



4^-^ 1 



9 9 T > "1 



•^^^^23 



0, 


1 - H., 




. 0, 


0, 


0, 


0, 




1 - t\, 


— n 


'T' i.ii. /y* rv* 

/ 12 T^ '28' 13? 


""^ "^^12 "r '^^23'^'l3? 




0, 


— 1 


'^'iS + '^Y3'^'l2j 


0, 


— 


'^^13 + '^'l2'^''23? 


T'lt, 


0, 


"^ ^"23 + ^*r2'^'*i3? 


»« 


^^'23 + '?*r2'^^3? 


rn, 



12? 



0, 

^'23? 

-h 

rv>- 



256 



PROFESSOR K. PEARSOIsr AND MR. L. K. G. EILON 



Subtract rjg times the fifth from the secondj and r^o times the fourth from the third 
columB : 



M {(Ti, n^) = 



4?!'"'' 1 

cricr|cr| W 



0, 
0, 

' 13 n^ '23' 12 J 

0, 



1. 
0. 



r-i 



12? 



' 13j 



a-* 



a-* 



0, 

' 12? 



t' 



235 



0, 

^'23? 



13 







/ O'J 



r.o 



Add ri3 times the second column to the fifth and rio times the third to the fourth, 
we have 



M ((Ti, ^28) 



4?^^"^ 1 






4# 1 



cr^crla-l W 



0, 


h 


0, 


0. 





0, 


0, 


1, 


0, 





-tii2j 


«-« ^''125 


^^I2j 




-"23 


-ttis? 


'^*13j 


"- ri, 


b -^ R235 


— ^22 


0, 


— ^3. 


^ r2: 


>, — • Ji>m, 


~E,3 


tt'335 


1^23? 


R21 






J^28? 


-ti22? 


Ri. 






t^is? 


ii|25 










J~l ini {^13 (R23R.I3 -^ R'2lR33) + Rr2 (R21JR-23 — ' R13B22)} 



CTjCTgO-g 



B^ 



4'?# 1 



cr^<r|cr| R 



J { rVl3^12 ^ "T -tll2'^''l3 J^ 



4^/^'' 1 



9 9 "O i 



{rn {rn — ^■'i2n>3) + ^'is ('^'12 — ^^la^a) 



1 



s 9 a 



« IJ^XJv\»l» 



Now we have seen that 



Lath A^ff. Jbf ' 



'o-i'ga 



by (viii.) of Art. (4), 



o-i-^rga 



and further %^^ and S^^^ are given by (xv.) and (xvi.) of Art, (6). Hence 






In the next place we will determine Rrj.ns' The minor M (rig, Vy^ is given by 



MATHEMATICAL CONTRIBUTIOISrS TO THE THEORY OF EVOLUTION. 257 



M (rj2, Ty,) - 



ov' 



2 2 *> 



Ib-f- It}} 

E ' 


E ' 


E ' 


E ' 


0, 


E ' 


It + L22 

E ' 


'?''2oiA2;j 

E ' 


0, 


lloo 

E ' 


'^^IP.l^l.'] 


'?''2.'{J^^2.'] 


E + E.j.; 


J'13 . 


-I4I! 


E ' 


E ' 


;r ' 


E ' 


E ' 


E ' 


Ilj2 

E ' 


0, 


d /E,:\ 

rfj',2 \ E / ' 


ft / 1^23 


0, 


1\.2«5 


E ' 


fZr,., \ E y ' 


A /R20 
f^rgo \ R 



n-" 



0-f(7l(7lR« 



^^2-1^12? -A-^ + -tl'225 '?''23^23> 

'^^131^13? ^''23tv23> l^'+R-JS? 



11] 2) 1^12? 



0. 



Ro 



3) 



0, 



-t^23> 



i^l3> 




0, 


0, 




1^'235 


Ri., 




1^23j 


2Ei3il.i2 

E 


7 00, 




2E13E2.) 

E 


n%, 


2Ri . 
It 



Add the first three rows together, multiply by R12/R and subtract from the fourth^ 
and by R23/R and subtract from the fifth, we find 



M {ry>, Ti?) = 






cria"2(7r,rt^ 



R + Rii, 


^*12l^l2> 


^"la-t^iss 


Ri,,, 


0, 


^^2"Rl2. 


R -j- R22> 


^323, 


0, 


J^V23, 


^'l3-t^l3j 


'^"23 1*^23 > 


R + R33, 


Rl3? 


1-^23? 


•— liio. 


- R12, 


"~~" juXXi2^ 


"~" '^''233 


^'l3) 


-«- 2rio3, 


- R.,. 


1^23? 


' 12> 


L 



Multiply the fourth column by r^^, and the fifth by 7^23? and subtract from the third 
column 



M (n2, ria) = 



fV' 


R + Ru , 

ri2 tii2 , 


'^''121^12 > 

R + R22 J 


0, 
0, 


Rl3. 

0, 





o-fo-lo-pi' 


R23 




'^'^13 ^"^13 J 


^^'23 tv23 ; 


■^-1^83 > 


1-^13 ) 


-1^23 




— R12 ) 


1^12 ? 


2^12 , 


■"" ^^'23 > 


— T, 




2XV23 , 


— Ii23 5 


0, 


'^12 3 


1 



13 



Multiply the third row by Ty^, and the fourth by 1 — 7^12 or R33, and subtract the 
latter ; the determinant now i^educes to one of the fourth order, and we find : — 

VOL. CXCL — A, 2 L 



258 



PROFESSOR K. PEARSON AND MR. L. N. G. PILON 



M (rio, r,3) 



2iv^ 


R + T\„ ^ 


^^2^12 . 


•l-^'13 ? 





\<Aa^ 


^12 t^.l2 ? 


Ja. -4" '1 ^'22 ? 


0, 


R.3 




"^'12^33 "T '^''13 1^23 ? 


^■'12^33 + nz^^'W,- 


jLi'og 3 


Ri. 




- 2Ro, , 


— R23 5 


-- ^'12 , 


1 



Add the third column, multiplied by r^^ to, and subtract the fourth multiplied by 
To^ from, the second, and then subtract the first 



M (no, r^s) 



2iv' 



9 9 9T~>."; 



R + Rn , 


•-"^ '^-t^ll 1 


.]:v]3 , 





'^12*- 1*12 9 




0, 


Pk>s 


^^12^33 + ^13^23 , 


0, 


■'''23 > 


R,. 






— ^'la , 


1 



Divide out the 2 in the second column, add it to the firsts and subtract the third 
column multiplied by rig 

M (r,2, ^13) = - 



45/' 


R — r,sRi3 , 




Rl3 J 

0, 





0-fff-lo-lE" 


li,3 




^l2-tt33 1 


0, 


■tvas > 


.l-il3 




-— K'2:5 , 


_ n^ , 


"" ''12 ) 


1 



Add r.,0. times the last row to the first 



M (ri2, ris) 



4:W' 


R33? 


- 1, 


' 13? 


^'23 




-t^ •""" '^''23^x23? 


■»-i-22> 


0, 


J^23 




^"12^^33? 


0, 


ii23j 


R.8 




• t"V23r 


-- r.,^, 


_^ '?'l2> 


1 



Subtract 7*23 times the first row from the last, and remember that — R23 —'>'i'Mzs=^'>\J^Hii 



M (na, ^^13) 



4.n^ 


RsSj 


""^ Ij 


— ^3, 


^'*23 


o'io^pW 


\x — ?^23ti235 


"tt22j 


0, 


J-"t23 




'^^12tt333 


0, 


-i-'^'23j 


R,3 




^"rztiiss 


0, 


•ti'12? 


R,i 



Multiply the first row by B22 ^i^d add it to the second ; the determinant reduces 
to the third order, 



M (ri2, ^lo) = — 



4?^''^ 



9 9 91" 



1"2 



27^5 






]2^n^? 



'^'*1 3-1^225 

1^23, 

R'J2? 






MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLIJTIOK. 259 



Expanding this determinant from its first colixmnj remembering that R23^ii"~I^i3Ri2 



2Z^^P 



M (ri2, Trs) 



4ri' 



'2^2^2];)5 I '^'23^^ (i^ "~" '^28ti23 + J-^22 1-^33) + ^^12'^13 (A^22tlll I^12)-L^33 

"^ ^Vi'^lS \~L~^22-tM3 -1-^2-^^23) -t^isl^ 



Or, since R22R11 — R12 = R, £12^23 — ^22^13 = rv^, and U — ^3^23 + B22R33 
M (n,, ri,) — li-i-sji {2n322Ka3 - rr^r^, (Egg + r23R23 + ^'isTiis)}, 



8^^^ J . -p P 

2 2 2T>1 M 23*^22 1^33 



'^'l2^'l8 



R V- 



Hence, since 






AS.S. 



na^^'is 



we have by (xvi.) of Art. (6) and (xxxiv.) of Art. (9) 



Rr,,r,, — '^'23 



^ It 



r. 



23 



2 ' 12' lo -p p 



(xxxvii.). 



To complete the theory for the errors made in an investigation of the constants 
for a system of three correlated organs, we require to determine the probable error of 
a regression coefficient for a partial regression of a first organ on a second, the third 
organ being constant. This coefficient is given by 



3P12 



'^'t2 "~ '^Wi;; ^-^1 



»2 
^'23 *'^2 



Take logarithmic diiferentials 






&'l2 



^'23 ^'^'13 



' 12 ' 23' 13 ' 12 ' 23' 13 



+ Sn>3 



13 






Ji'Vi)^^ 



L/12 



' 9*1/ 1« -L / iyo 






ScTi 



CT, 



cr.. 



Let this be squared and divided by n, and then the values found above for the 
standard deviations of the errors in Vy,, Ty^, r^r^, ai and 0-2, and for the correlations of 
errors in these quantities be substituted. After some lengthy algebraic reductions, 
which it seems unnecessary to reproduce, there results 



1 

U#12y 



1 - 



^ 23 ' 13 



7% + 2T^^y, 



^^ (^i2 '^\%^uy 



2i la 2i 



260 



PKOFESSOIi K. PEARSON AKD MR. L. K. G. EILOK 



or 



t. 



1 cr,^(l 



I OS 



•j:o 



3PJ2 



-^/r^ 0-2 



^8 "^ ^i2 + 2?v'i,r^,) 



. . (xxxviii,), 



The percentage probable error in a partial coefficient of regression is accordingly 

67-449 v/R/V^C-Bm-^X 

Before discussing the significance of these quantitative results for three organs, it 
seems desirable to complete the general case by investigating the correlation between 
the errors made in the correlation coefHcients of a first pair of organs and a second 
different pair of organs. 

(11.) Case (ii.). Cam of Four or more Organs.— -In the case of four or more organs 
the only new probable error will be that of a partial regression coefficient, but this 
can theoretically always be found by the method of the preceding paragraph, provided 
we know all the error correlations. The only novel correlation among the errors 
will be that of r^2^''^^^ ^^^d this we shall now proceed to investigate. The discovery of 
an error correlation coefficient of this type completes the theory of the erroi'S of 
normal frequency constants. 

Instead of evaluating A of (xxxiiL), which in the case of four organs appears to be 
very laborious, we may proceed as follows :— 

If A be written in the form 



^Vi(ri5 ^^(T^o-i? 



a 



(Ti<r.pJ 



o-iO-i 



• « 



^^o-^tT^^ ^^<T.i(r.i 



^^(TVVI^ ^^O-o' 



S'12^ 



^<ri)'^^ ^^'/'S.tj 



^^o-^o-i? ^^cTiv'igJ ^(Tirn^ 



» « 



<^^a-,a-,5 ^^o-4>'i;>3 ^^o-i'-i;.? 

^cr,/!,? ^ao/'i,? ^rj,ii,5 



^<r,7'345 <^^rja>3i9 ^'-JS'W' 



e a 



• « 



^^o-irg^j 



» b 



a 



0"4''342' 



a 



^13^31^ 






and M denote the minor of the corresponding a in A, we have by a well known 
property of the determinant. 

Divide by A, 



a 



M... 



<^4<^i 



A" ^ "^"'^ 



4f^3 



A 



4- a Miri2j_53) 4 
^ "^^"^^ A ^ 



M, 






MATHEMATICAL CONTRIBUTIONS TO THE THEORY OP EVOLUTION. 261 



M/,.., „.\ M{,.j^, „.^) M(,.j,, ^.,) 



Now ~<^aii5). , 



M 



('''13> ''34) 



* • • 



, are the correlations between the errors 



AS,. 2. ' AXr 2^ AS. 2 A2r 2,. 

made in the various quantities (Xi, o-g, 0-3, 0-4, rjg, ^'13, ^m, ^^23? ^"21? ^34? every one of which 
is known by the previous investigations except that of 7^12 Q^nd 7*34 or M(,,^^^^ .,^^)/A2,,^^2,.3^. 
Hence the above equation will suffice to find the latter quantity, since 2^.^^ and 2^^^ are 
known. We have 



a 



n Burn 



cr^a-i 



a 



0-40-3 



^^0-4^1, 






li 






n — 



n Egi^ai 



•^ 



O'l^^CTg It 



n 



cr: 



i+|). 



by (xxiv.) and (xxv.), 



J 



a 



<^4''l4 



0, 

n Ell 
<ji E 



n =z 0, 

04'13 -^ 



a 



O'i'^S 



0, by (xxvii.). 



0-4,/ '04 



n E.21 

O"! 1 



n Eo, 






a 



<^4''3i 



O", 



E ' 



by (xxvi.). 



Further 



A ^ri2 0-i'^n,^<ri — 9,, M2VJ- '^ is^, 



A 


- O-V,.^ 


■'■•-'-('■jai o-j) - 


». _5a 


A 


2n 


— ^" ■■■" ' W"^ 




M(n2,cr4) _ 


^'t 



2n 



7\., (1 — ria), by (xvi.) and (xviii.), 






M 


I'm 


ni) 




A 




M 


f''ia 


'''24) 




A 




M 


On, 


^34^ 



A 



S- {''^24 ('''U — ^24*^12) + ru (t\4 -- »'l2''l4)l, 



= ■:^ {''24(1 — 'I2) (1 — 'u) - h'^n'^iAs]' 
= — {»'i4(l — ^'12) (1 — 'lid — J ' ■12^^24 Ras}, 



\ IW3. (1 - *^i2) (1 - 'I4), by (xvi.). 



by (xxxvi.). 



J 



^ 



y by (xxxvii.), 



-* 



Now substitute and divide out by common factors, and we find 

{r.^ilu + ^',3,4) ri2 (1 — ^%) + rs4ll34 {^'23 h'n — '^12^3) + '''is {^'23 — 'rn>\M 

+ (R + E44) {»-24 ('^14 - ^'24^12) 4- ^14 (''-24 - ''^12^'l4)} 

+ 2Rh {^24(1 - r%){l — r\^) — I riariJIss} 
4- 2R2, {^',4 (I - r?2) (1 — A) — -| n^r^Ii,,} 
+ 2lWa,,^„^(l -ry(l -ri4) = (xxxix.). 



262 
Now 



PROEESSOR K. PEARSOIsr AND MR. L. N. G. FILOK 



E = E44 + ri4lii4 + ^24^^24 + ^'34-1^^34? ' 
== rnllii + Ri4 + ^12^24 + niR'Sl. 
= ^24^44 + ^^21^4 + R24 + ^2-^34. 

Multiply the third of these by Vi^ and the second by ?^24 and add^ we have 

= 2rHr24'R44 + ^•24^14 + ^^14^24 + '^^ {Vulki + ^'24^24) + (^s'^ 24 + '^23^'l4) R34, 

Hence, by the first. 



w 



hile 



^24^14 + '^^4^24 = R44 (^12 — 2ri4r24) — 'B^\> + K34 {;ry,r^,^ — r,^,n,^ — ^^ 2s^'*i4)> 



'^.^4^14 + '^"'24^24 = ^ -^ 1^44 "^ ^'34i'i34- 



By means of these relations^ let us get rid of the terms in R14 and R24 in (xxxix. 
above. 

lie-arranging we have^ after some reductions^ 



2ri2(l — r?2)R + 2ri2R33R44 + 1134(^4^^23 (ns -- n2^'23) + '^^'13(^3 ^ ^S^^s) 

+ 2(1— 7I2) (^12^34 — ri3?v4 -- 7%3^^i4) + 2ri4r24r34 -- T ^r,, (ri + r\^)} 
+ 2R34li.,,-3. (1 - ^^?2) (1 - rl) -= 0. 

Hence we can divide out by II34, and accordingly, 

= — buT^'iiriz — mi\-^ + nir\;{r.-,; — ri.Vjg) + 2(1 — rL)(r,,r34 ~ r^^Vu — r,3r,j,) 

+ 2fi4n//-34 ~ riani (r|j + r?,) + 2r,.R34 } . 
Noting that 

E34 = — ^4 (1 — ^'12) + ni^"4i + ^"3s>"« — r,., (niVi., + r32r,ji), 

we have, after substitution and rearranging, 



2Rn...(3 



rh) (1 — ^m) = (^13 "- ^'^i2^'23) (^24 — ^'23^'34) + (^'14 "^ n4^'l3) (^^23 ^ rr^\^ 



Or, 



^V'1S^*31 









2(l-ry(l-ry 



(xl.). 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 263 

If we put 4=1 in this result and remember that o^u = J-j we find, after some 
reductions, 

-p , 1 /y, ]: ^'r2 ^'i3 "^Ih H- ^^V >^^%.']^^ii 

which agrees with (xxxvii.), and may be taken as a verification of this result.'^ 

(12.) We may draw several conclusions from the results (xxxvi.), (xxxvii.). and (xl.). 
(a.) While errors in the correlations of a first organ with a second and a third 
have a correlation themselves of the first order, errors in the variation of a first 
organ and the correlation of two others, or in the correlation of two organs and in 
the correlation of a second two, have only correlation of the second order. Thus a 
selection of the correlation between two organs modifies the variation of all organs 
correlated with one or other or both of the first, but only in the second degree. 
Again, a selection of the correlation between two organs modifies the correlation of 
every other pair of organs, one or both of which are correlated with one or both of 
the first pair ; but this is only in the second degree. 

(/3.) If two organs be entirely uncorrelated a random selection of the variation of 
a third organ correlated with both of them will tend to generate correlation between 
the hitherto uncorrelated organs, i.e., put rgg = in (xxxvi.), and we have 

If a variation Acti be made in cti the probable value of ^3 is 



%3 . ^ _ Act, 



Ar23 = R^,,,3 -^ Ao-i = 2rpj^3 ' ^ 



which may clearly be of sensible magnitude. Thus correlation may be generated 
by selection of variation, and vice versa. 

(y.) If two organs be each entirely uncorrelated with a third, yet a random 
selection, which produces a correlation between one of these organs and the third, will 
produce a correlation of the first order between the other of these organs and the 
third, ^,e., put ri2 = ri3 = in (xxxvii.), we have 



R...,.„ — r 



I !•)/ 



12' J3 



23? 



a correlation of the first order between the probable changes. 

(8.) Consider four organs of which the first is alone correlated with the third and 
the second with the fourth, the third and fourth being themselves uncorrelated. 
Then any random selection which produces a correlation between the first and 

* The probable error of a partial regression (3oefiBcient for p organs has not been worked owing 
to the la-bour involved, but judging bj the cases on pp. 245 and 260, it may safely be taken as 
67'449 y/Hjvn ( — Rig)* where R is now the determinant of the p^^^ degree. 



264 PEOFESSOR K. PEAESON" AND ME. L. K. G. FILOISr 

second will tend to produce a correlation between the third and fourth^ i,e,, if 
r^o =: ^14 = ^23 = ^.34 = 0, we still have from (xL) 

(€.) We may further illustrate these principles by one or two hypothetical examples 
drawn from actual organs. 

Let the actual organs be (1) physique of father^ (2) artistic sense of mother^ 
(3) physique of offspring, (4) artistic sense of offspring. Suppose in the general 
population there is no correlation between physique of father and artistic sense of 
mother^ or between physique or artistic sense of parent, and artistic sense and 
physique respectively of offspring. Then Tjo = 0, 1% = 0, r^?, = 0^ and, presumably, 
r^i = 0. Hence 

is the product of the two coefficients for inheritance of physique from father to 
child, and for inheritance of artistic sense from mother to child. 

Now let a random selection be made out of the general population in which assor- 
tative mating between physique in the male and artistic sense in the female presents 
itself, ie., let Atjo be sensible ; then we have, most probably^ 

■n 2- 



12' 84 -^ 



or^ a correlation between physique and artistic sense in the offspring will tend to be 
developed^ Generally, when r^ and 7^12 do not start from ^ero, we have, 









or, any increase of sexual selection in a group tends to emphasise the correlation of 
the selected qualities in the offspring. 

Let the three characteristics be artistic sense (1) in a man, (2) in his mother, (3) 
in his wife. Then 

I ,__, /l^^ i»aM» A-l'^ 

^^r.r,. - ~ ^' ^-"^^^ ^1 - r%Xl - riy 

if we suppose ng to be zero. 

Hence any selected group with a higher coefficient of maternal inheritance of 
heredity will have a less coefficient of sexual selection than the general population, 
and vice versa. The tendency is, of course, independent of the magnitude of r, and 
really of the particular character. Supposing likeness of faculty or character to be 
a rough measure of *^ sympathy,'^ we might conclude for any population with inheri- 
tance and sexual selection, that on the average a selected sub-group of men having 
greater sympathy with mothers than the general population will have less sympathy 
with wives, and vice versa. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 265 

(13.) Many like propositions may be stated with regard to the action of selection 
on the correlation of characters. They require but little modification to state them 
for artificial or natural selection^ as they are here stated for what we have termed 
random selection. The above will, however, suffice to indicate how every form of 
selection of variability or correlation influences in a manner capable of quantitative 
expression the variability and correlation of all other directly and indirectly corre- 
lated organs. Selection cannot be of service in altering one organ only, it alters at 
the same time the whole inter-relationship of a complex of organs. Evolution by 
natural selection can never be the change of one organ to suit a particular environ- 
ment ; it is the balance of advantage and disadvantage produced by the change of 
all organs involved in the attempt to select one of them. The moment the intimate 
correlation of organs in animal or plant life has been fully realised—and this realisation 
owing to recent statistical investigations has become fairly easy™ then the conception 
of natural selection as moulding any single organ to what may be fittest to its sur- 
roundings must be discarded. The selection of the '' fittest '' in one organ would 
probably mean the selection of the unfit in other organs, and a general balance of 
fitness in the complex of organ is all that is possible."^' 

IV. On the Probable Errors and the Coefficients of Correlation betw^een 
Errors made in the Determination of the Constants in the case of 
Skew Variation. 

(14.) The case of Skew Variation has been dealt with at length by one of the 
present authors in the second paper of this series. He has shown that in a great 
variety of cases it can be dealt with by a series of curves having three principal 
algebraical types, each defined by a certain number of constants. The probable 
errors of the determination of these constants were not then investigated, but it is 
clearly of great importance for the practical use of these curves to know how far 
these constants can, for any given number of observations, be depended upon to give 
an accurate measure of the skewness and its special features. At the same time an 
investigation of the probable errors of these constants leads us to a number of novel 
properties which are connected with the theory of evolution in the frequent case of 
skew variation. 

^' Take, for example, result (xxxvi.) ; as far as terms of the second order are concerned ^a-xi'z'i ^ "^^ '^i3%- 
Hence, witii positiye correlation between three organs, ih.e effect of trying fco g^i a group very stable in 
one organ, i,e,^ with a negative Ao-^ is to reduce the correlation between every other pair of organs ! In 
other words, we have, to reduce variation at the expense of correlation, increased; stability of one organ 
is gained at the expense of decreased stability in the inter-relationship of other organs. This may 
possibly be illustrated by the long bones of the French, where the lesser variability of the male relative 
to the female connotes also a lesser correlation. See Lee and Pearson : " On the Relative Variation 
and Correlation in Civilised and Uncivilised Races," * R. S. JProc.,' vol. 61, pp. 354-356, 

VOB. CXCI.—A. 2 M 



266 PROFESSOR K. PEARSON AND MR L. N. G. FILON 

We shall deal first with the skew curve of Type III. (''Phil. Trans.' A, vol. I865 
p. 373)5 because its treatment is less complex and leads at once to some general 
principles which must be borne in minclj whenever natural selection acts upon an 
organ exhibiting skew variation. 



(15.) Prohable Errors and Error' Correlations of the Conskmts of the Generalized 

Prohability Curve of Type ;?/ = ^/^ 1 + ) er"^^'' . 

This is the equation of the curve referred to its mean as origiuj where 

„ _ ,, 7 g-^^'-''>(i^ +i)" . li ^ 

^^ Til) ■{- ^) 

Further^ the moments about the centroid vertical are given hy, 

^p-\-l _ 2(^4-1) _3(£jJLK£ji_3) .v.. 



^' " 7" ' ' ' 7' ' • ^ 7^ 



or^ 



y = 2/X2//X3, j::> — 4:^1^ I iiz — 1 . . . . . . . (xliii.). 



The criterion for the application of this curve to any frequency distribution is 

or, if we write ^2 = H^i/l^'d^ fii ^ /^3V/^2% 

6 — 2/?2 + 3^1 ^ ....... . (xliv.)« 

Lastly^ 



>S%. ^ the skewness ^ l^/xg/ (/x^)^^^ — • .7^ ^--~ (xlv.), 



and the modal frequency. 



'^ e''V (p + 1) ^ ^ 



We require to know, the probable errors of p^ y, yi, yo, cr ^ \//x2j /X21 fta? /^4) ^i^d the 
skewness. We must discover the best physical constants to describe such skew 
frequencies and we shall at the same time succeed in deducing certain— we believe— 
novel properties of normal frequency distributions as limiting cases of this skew type 
of distribution. 

"^ See ' Plail. Trans.,' A, vol. 186, pp. 3?8--4. 



MATHEMATICAL OONTEIBUTIONS TO THE THEORY OF EVOLUTION. 267 
(16.) The first stage in tlie investigation is to apply the general proposition of our 



iLru. ^j tO 



log 2/ = log n -f logy — {p -\- 1) + p\og (p -\- 1 ) ~ log T (p + 1) 



+ plog(l +jy-^^j-~yx. 



We nnd : 



fP (logy) _ _ P _ 1 _ -f .,_/,, I 1 \/ , 
d^ilogy) 1 ( 1 p 1 



»? n 



dx dp a \1 4- xfct p + 1 (1 + xjaj' 

^3^M} _ ^x L — I 

dx dj ^ + 1 (1 4- xlay ' 

d^(logy)_ ^^_i^g^^^^^^^^ 2 1 ,. 1 



dp^ dp^ ^ ^^ ^ j^ + 1 1 + x/ci' (p + 1)^ (1 + ^^Icif ' 

^LCMl') = _ 1. /« . 1 _ o„ 1 ^4. __„iL^ \ 

^'(log?/)_ 1 /, 2p + l 1 p 1 \ 

dp dy 7 \ I? 4- 1 1 + ce/a p 4- 1 (1 + wjayj 

Let I^, = i/i ( I + "^^y'^^""^''<^^? then we easily find r^,,..^ = ■^---~-- I^, and 

T — (P + 1)^ T 

-'^^-^ - p {p ^ 1) '^• 

By aid of these we can at once write down the integrals of the above expressions 
multiplied by y, since n = T^. We find with the notation of p. 243, 



f d- (loff y) . 

flu = — 1/ —-^f^- dx 

J „« '' doc- 

f* # (log f ) , 

ap ==: I -^ — )--^^^'^-^- clx 

1^0 dxdp 



p -1 



) 



ny 



Ix dp p (2-) — 1) ' 



# (log y) 7 2n 

a-jg = I y -------^^ dx = — -; , 



— « 



„ _ f" ,, ^i' (log ?/) ,7^ „ 2,^ (^ + 1) 

f"^ # (log ?/) ^ nip-j-l) 

2 M 2 



268 



PROFESSOR K. PEARSON AND MR. L. N. G. PILON 



Before we consider the determinant and its minors, we may note that 



B B B" 

log r (p + 3 ) = log ^/{2tt) + {p+i} log p - p + Yjz ~ o ;|;H + g^^, - 



ASA 



where the B's are the Bernoulli nnmbers. Hence 



dr' 2p — 1 B 

%2 t t3 \/ J /) 22/' p'^ 



p^ ^ / ^ 



and we ha^e the convenient form, 



(P 



%2 - ^M 71 log V{p + 1) 






^y?. 



'P- 



ilL±J 



+ S 



where S is the semi-convergent series Bj/p — Ba/p^ + Bq/P'' "^j &<^- 
Now vv^e have at once, 



A 



a 



11? 



Cl 1 ') 



12? 



^hz^ 












<^/83' 



= n" 



7 

_7„„ 

IJ If - 1) 

2 



7 



^^ [xZ^-l. \. g 

p 4* 1 



___ i^ H- 1 
7(P- 1)2^' i'iV- 1)' 



Divide all three columns by l/(p — 1) ; the first row and :first column by y, and 
then the last row and last column by l/y ; we find, 



A 



'??." 



{p - ly 



1 



1 

V 



2, 



1 

V 



9 
p + 1 

2{p + 1), 



Divide the second row and column by 1/p ; add half the last row to the second^ 
and we find. 



A 



"nf 



pHp- !)■' 



JL 4 

0, 



1 



2 



(p-l)S, 




2 09 + 1) 



MATHEMATICAL OONTRIBUTIONS TO THE THEORY OF EVOLUTION. 269 

Subtract the first column from the second, and add the double of it to the last, 
we deduce 






•J 



f{P - 1)' 



or, 



1, 


0, 





0, 


(f-1)S, 





•2, 


- (p - 1), 


2(p- 


A = 


~ f- {P - 1) ' ■ 


6 9 * 



1) 



(xlvii). 



The minors can now be written down at once. 



whence 



or 



5*? 



Ml, 



M„/A 



2(2:) + l) n^S 



p + 1 



1x2 ^^ (T^jny 



^% -«^ Q" I ^ '}1 , , , , 



« « 



, (xlviii,). 



whence 



or 



M22 = -~— : ^ 



^^=M.Ja 



-22 



jy^/nB, 



t 



'p 



P 



/I 



c • 



« I JvilX* K 



whence 



or 



O 9 



M„ = — '"^' - (i- -\- 8) 



I = M33/A = ^(14 



S.= 



7 



V(2«) 



2 '/I 



a/i 



i^D 



1 4- 



2S 



a • « • « f 



• a-)- 



whence 



M 



ryn- 



':":2a rr"- 



f(f-i) 



R 



'Py 



M,,/(AS,s,) = y{i/(i + s)} (li.). 



whence 



M12 = 0, 

^^hp ""^ U • . « » , » , , » . . (Ill, K 



270 PROFESSOR K. PEARSON AND MR. L. F. G, FILON 



M,3 



Lastly, 
wbence 

This completes the direct series of probable errors and error correlations. By aid 
of the above correlations and standard deviations we can now find a further series. 

From (xlii.) we have for the standard deviation cr (about the mean)^ a^ = fi.y.=z ^---^ ^ 

s/if + 1) TT 

or or == ^ • Mence 

7 

— — "9 '"' "'^r i' ""^ 



Q ^ a a s » » • » « ♦ l.ll. >«/» 



Square both sides of this, divide by n and sum^ we have at once from the definition 
of a coefficient of correlation 






Hence, using (xlix.), (I), and (li.), we find, after reductions, 



^"^~ V'(2«)\'^ "^■'^(F + l)'sj • ' ^^'^' 



Multiply (liv.) by bJi, sum and divide by n, we have 
Whence^ by (lii.)^ 



-V 

cr 



or, reducing by (I), (liii), and (Iv;), 



_ //^2 \_ 1 



^Iw — "^ \/ ( ' "V 1 ) // 1 X , . . » . (IVJ,)* 



)^s, 



Next, if St be the skewness, we have from (xlv.) 



^M|. 



^' "" ^ ( P + If' 



q /' > J 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OB' EVOLUTION. 271 



or 






Similarly 

'^Bkh ^^= "^^"^ = Oj Since ii/^.j = 0^ 

or 

'^mu = ....... . . (IviiL). 

We easily obtain , by multiplying (liv.) by Ap, the result 

"R 1 : l\\^\ 

^\><r— {lJ^2{p-\-lf%f * ^^^^•^• 

We can now obtain R^g,, for every Ag, is negatively proportional to the corre- 
sponding Ap. Hence 






1 



aSk 



(1 + 2(p + ly^s} 



D^Sp •*•••••• v^*)* 



We next pass to the mean and modal frequencies as given by (xli.) and (xlvi.). 
We have, by taking logarithmic differentials, 

where 

J - ]r^ + jp ^H" r ( p + 1)} - log {p + 1). 

Bemembering that, if Sir = a^, g = ~— r;; — - , we easily deduce 






1 






2^/ /r, ,._... 29 



^h + 3 (Sk.)'- + y (Sk.)' -f 9 (gk.)4 



as far the 7* power of the skewiiess inclusive. 

Very generally the probable error of the skewness may be taken as equal to 



^/ \0^ ) ./SI L Q /GV \2l » 



3 

,2W v/{l + 3(Sk.)''} 



and it is always less than. v/(3/2f*)j its value in the case of a normal frequency. 



25^2 



PROFESSOR K. PEARSON AND MR. L. F. G. MLON 



Squaring^ introducing the standard deviations, and rearranging^ we find 



v^2 



7" 



2„ R 



\2 



t> 



^ = ^(1 - Ry + (J - 1-^) ^ = iiji + t(Jp - ^ 



We must now evaluate ip — |. This is easily shown from the Bernoulli 
number expansion for log r (^ + 1) to be given by 



3p 



9 



p , p + 1 

_ p l^^^ ^,: , 



\Y 



here 



'T _ _?L _„ "t^L j_ :%» 
2p 4p^ Gp'"' 



S ft i 



Thus we determine 



^ 



'1 



;yi 



j» 



^- = 7(1^) { ^ + I (^ -^^S- ^^-' - ,7^-1 + T 



0-1 1 



V 



. . . (Ixi.). 



Expanding the expression in brackets in inverse powers of p we find 



^tt, ■""" 



Vi 



'Vi 



X -T- 



49 



^{2n)\ "^12^ 



28 , 248 

— „,.| 

3p2 "T" 15^y3 



a « 



• » « I XjAii> f I 



Eesult (Ixi.), however, with S and T calculated to ljp\ gives a better value than 
(Ixii.). 

To find the modal frequency error we must take the logarithmic differential 
of (xlvi.) and proceed in the same way. We find almost at once 






^i^ 



i^ 



/I T\ 

\ 2 > /• 



Whence on squaring and completing the square of the factor of %p^^ we find 



^'t 



2-- 



2n 



< 



2T 



10, 1 



V ? 



an 



d 






//o J j I "^-^ 

^(2%) [ ^ S 



4 



» » » 



J 



e e 






Expanding as far as powers of 1/p^ exchisive we obtain 



^Maf 



iio 



v/(2?0 1 "^ 12i' J 



ft 4 e » e e 



. (lxiv.)3 



a very simple expression for the probable error of the modal frequency, jfg. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 273 

In like manner we shall now determine the probable errors of the moments and 
their error correlations. 

Take the logarithmic diflferentials of (Ixii.) 



A/X2 


Ap 2A7 


/^2 


p -{- 1 7 


A/i, 


Ap 3A7 


Ma 


p + 1 7 


A/i, 

.->-,— — ^...w-^ mm 


2 (j9 + 2) Ap 



f^i 



4A7 
(p 4- 1) {p + 3) 7 



Squaring each of these in succession and using the known values of S,j ty, B^,^ 
we find 



^iH — H-' 



' ^/ {2n) 



•Wi 



M» 



/^3 



o 



V 



7^1 



= /^4 



A. 

7(2«j 






1 + 



I -p 



1 + 



2S(^ + 1)\ 

(p + 'if 
18S(j9 + 1)'' 

(2p + Sf 



* » > • • 



2S(2J + lf(p + 3)'^ 



(Ixv, ), 



(Ixvi.)j 



. . (IxviL), 



Now multiply A/X2//X0 by A/X3//X3 and we find, after some reductions. 



Li 



fA#3 



7 



1 + 



.1. -1- 

" i 



]} + o 



6 S 0? + 1)'- 



2 S (jp + l)'-*. 



1 + 



18 S (/J + i)"^ 



• • t IXXviii*^! 



Next multiply Afij/jiAg by Aju,4//i,i, and we ultimately have 



1 + 



2p + 



3 



K 



2S(p + 1)-(f + 3) 



^3/^4 



V 



l^'H" 1 



1 



2 S ( p + If J 



1 + 






(Ixix.). 



Lastly, multiplying A/X3//X3 by Afii/[JL^, we deduce, after some reductions, 



R 



I -p 7j 



2|M- 3^ 
6 S C/ + 1)^ 



M3f^4 



v{('+slKy}A/{0 + 



(2i. + »f 



2B{p-^lfip+dfJl 



VOL. CXCI. — A, 



o 



Z N 



( xXX» /» 



274 



PROFESSOR K. PEARSON AND MR. L, N, G. FILOK 



We iiia^y add to these results the values of S^^ and Se,? where /3i and jB> are given 
by (xliv.) ; we find 

4: p ^ 6 i^ 



^1 



\/n{iJ + 1)V^^ 



ih 



.s/n(p+ 1)VS 



^ 4 O 



« I 12v J\ ie f * 



The distances from the mode to the mean, d, and from the mean to the end of the 
rangOj a^ are given by 



Hence 



Lind farther 



d ::::=• l/y and a 



p + 1 



n. 



V 



'd 



d 




L "4^" 



y 



1 N 



9 ^ 






a 



v/(2?0 




'■ + 2S(jp + l)^ 



R 



/ 



9 



cWt 



vv^ 



E 



f(A. 




- s 

2 
2] + I. 



S- 



V \ 



3 



1\" 



p + 1^ 



+ s 



_^ 



(Ixxii.) 



t i » I i.X.A.Itl. I. 



. (Ixxiv.). 



The results (xlvii.) to (Ixxiv.) must be now considered at length. 

(17.) {gl.) The frequency curve of the type considered is fuljy described by the 
tlu'ee constants^ the mean, y, and j>. But, since cmy three constants would do 
equally well— -for example, what may be termed the tlu^ee physical constants : mean, 
standard deviation (or variation), and skewness— it becomes of some importance to 
inquire which constants have the least percentage of probable error* 

Now (xlviii.) shows us that the probable error in the mean is precisely the same 
as in the case of the normal curve and 

Thus, the percentage error in the mean 



'67449 



100 



cr 



'6744^J 



X coefficient of variation. 



and will certainly be small whenever the coefficient of variation is smalJ. Its value 
is quite independent of the order of p. 



MATHEMATICAL CONTRIBUTIOlSrS TO THE THEORY OF EVOLUTTON. 275 

On the other hand, the percentage probable errors of p and y are from (xlix.) 

and (1.) 

67-449 1 , 67-449 //, , 1 \ ,. , 

7 —jz and —-;--; \/ \l + 7:7;] respectively. 
^/n v^S ^/{2^l) V \ 2S/ ^ •^ 

> 

Here S is equal to the series Bj/p — Bs/p^ + WP'^ — . * . which tends to zero as 
j3 increases. 

The errors in p and y thus tend to increase indefinitely as p increases. It may then 
be asked how the form of the curve can be determined with any degree of accuracy. 
The answer is simple : Equation (li.) shows us that the correlation between errors in 
p and y tends, as j3 increases, to become '' perfect,'' i.e., unity. But as p increases 
indefinitely, it has been shown that the frequency curve of this type passes over into 
the normal form.^ It is the high correlation between errors in p and y which 
renders the curve, when plotted to observations, such an excellent fit. If the errors 
in p and y were independent, this would not be so. At the same time it renders 
p and y unsuitable for tabulation as physical or biological constants of the frequency. 

Turning to (Iv.) and (Ivii.) we see that the standard deviation, o-, and the skew- 
ness, Sk., are suitable constants for tabulation. Their probable errors do not tend to 
increase indefinitely with jp, and will always be small, if n be large. 

Hence a frequency distribution of this type is best defined by its mean h^ its 
standard-deviation cr, and its skewness Sk. These are constants characteristic of 
the group, for they are given with small probable errors. If it be desired to draw 
the form of the frequency-curve, then its algebraic constants, p and y, may be found 

from 

1 , 1 



and the possibly considerable errors in p and y will not vary largely its actual shape. 

(^.) The nature of the probable errors of the other allied constants may now be 
considered. The mean and modal frequencies per unit variation of organ, or 2/1 and 
2/0 are seen by (Ixii.) and (Ixiv.) to have small percentage probable errors, and are, 
therefore, good for use as characteristic physical or biological constants. But it 
should be noted that the modal frequency is considerably more exact for moderate 
values of p than the mean frequency. For example it would be somewhat better to 
tabulate the modal than the mean frequency of the barometer as a physical charac- 
teristic of climate. 

The probable errors of the distances from the mean to the mode and from, the 
mean to the terminal of the range are given by (Ixxii.) and (Ixxiii.). Since 
c^ = l/y = cr/y/ (1 + p), we may write the first 



^' = :7k. ^/{^h, + 



y^{2n)'V \1 +2^ ' 2(1 +p)S^ 

* ' Phil. Trans.,' A, vol. 186, p. 374. 

2 N 2 



276 PROFESSOR K. PEARSON AND MR. L. N. G. PILON 

This remains finite, even If p be Indefinitely great On the other hand, the 
probable error of a, and even its percentage probable error^ becomes indefinitely great 
with p. It is to be noted that a in this case becomes infinite. 

(y.) Eesults (Ixv.) to (IxvII.) give the probable errors of the second, third, and 
fourth moments. It will be seen that roughly, for a large p, the percentage error of 
the fourth moment Is about double that of the second. It might thus appear, at 
first sight, safer to work with the second than with the fourth, but this is by no 
means necessarily the case, for to deduce any quantity from one or the other they 
must he reduced to the same order. For example, the square root of fio must be 
compared with the fourth root of /x^. and the probable errors of ^[12 and (/X4)^^^ will 
be sensibly of the same order. 

Eemembermg that a^ = — ^^— ; = -77 — — rr , we may write 






v/(2?i) V Vi^ + 1 ' 18S(p + ly 



This tends to a finite limit as p increases indefinitely, and we conclude that the 
probable error of /X3 is always finite, and will in general be a small fraction of the 
cube of the standard deviation. The above remarks are a justification for the use of 
higher moments in frequency calculations. 

Equations (Ixvili.) to (Ixx.) give the error-correlations between the first three 
moments. They show that an error in the value of one of these moments will most 
probably lead to an error in the other two. We see that for p fairly large R^^^^ is a 
large correlation, while 11^^^^,^ and R^xg^^ are smalL In other words a random selection 
of an even moment makes a far larger correlated change in another even moment 
than in an odd moment. If p increase indefinitely we find the ratio R,,^^^ / R^^,^^ 
approaches the value 2/3 ; in other words, /Xg is more closely correlated to the higher 
moment /X4 than to the lower moment /xg. 

Formulae (Ixxi.) give the probable errors of the useful constants jSj = fj^l/jjil and 
^2 == iJ^i/iA* We see that they are small and approach the value zero as p is 
indefinitely increased. 

(8.) Let us restate the formulae for^ indefinitely great^ ix,^ for the normal curve of 
frequency 

In this case we have [lo == cr^ l^?> = 0, /X4 == Scr*, ^1 = 0, ^2 = 3, skewness = 0, 
mean and mode coincide. Several of these zero quantities, however^ tend to have 
definite probable errors. 

We have 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OP EVOLUTION. 277 



^p 



^-= \/^' 



p 1 



s, = V (^i) ^ = ^^4-. 



The first, second, fourth and fifth of these results are old ; the rest appear to be 
novel and of some importance. 

In the first place we notice that given a population which is really normal, we 
should not expect a random selection to exhibit all the signs of normality. Its 

skewness will differ from zero with a probable error of '67449 A/ (^~ )• For example, 

in a random selection of 600 from a normal population, tlie skewness will be as likely 
to exceed as to fall short of '034. Hence an exhibition of skewness of less than once 

to twice '67449 A/ (:^') insist not in itself merely be taken to indicate an absence of 

normality in a general population. 

Again, in a random selection from a general population, the mode will differ 
from the mean, even if the population be normal, with a probable error of 

•67449 a/(^) cr- Thus, in a population of 600, a difference between the mean and 

the mode of '0340- should not be taken to indicate want of normality. Generally, 
the divergence between mean and mode in a population must at least exceed once to 

twice '67449 a/(^)^* f^^r us to be able to argue on this ground alone that the 

population has not a normal distribution. 

Again, the third moment not being zero, but having a value of once or twice '67449 

X 2 a/ ( " j a^, is not in itself an argument for skew frequency. 

The above statements are an important addition to the second memoir of this 
series ; they give us the criterion, there wanting, to distinguish between a skewness 
which is characteristic of a population and one which might arise by the random 
selection of a population of the given size out of a larger, but really normal, popula- 
tion. 

(e.) We may now note the exceedingly interesting conclusions which these results 
have for the theory of evolution. 

Suppose an organ to have, as so many do, skew^ variation, then we notice 



278 PROFESSOR K. PEARSON AKD MR, L. ¥. Ch BMLOK 

(i.) Any selection of the organ by size tends to alter its variability, but not its 
skewness; this follows from (IvL) and (Iviii.). Further, if, as we have supposed, the 
range be limited on the side of dwarf organs^ then any increase of size means a 
decrease of variability, and vice versa, 

(ii.) Any selection of variability is a selection of skewness ; this follows from (lx.)« 
If a selection be made from a general population, which has less variability, then it 
will tend to greater normality. In other words, it would appear that stringent 
selection tends to generate normal distribution^ Thus, if out of a skewly distributed 
population we make a number of random, selections, that with the least variability 
will be most normal. Select at random again out of this latter selection, and the 
least variable group will again be the most normal, and so on. 

Now take a problem of this kind involving group^ and not individual^ selection. 
Let a large general population break itself up at random into groups, and let us 
suppose these groups, not individuals among them, to carry on a struggle for 
existence— an inter-group, not an intra-group, struggle. Then^ if it be an advantage 
to a group that its members shall be among themselves close to a type, ^6., less 
variable, then the more normal groups will survive^ for variability is positively 
correlated with skewness. Now suppose each group to be periodically subdivided at 
random into new groups-— the mathematical description of some process of group 
reproduction— then w^e see how normal distribution may be a result of a sti4ngent 
inter-group selection of groups whose individuals have the closest resemblance to 
each other— intra-group resemblance. 

(iii.) Any selection of the size of an organ produces by (Ixxiv,) an alteration in 
the distances between the mean and the mode, and between the mean and the end 
of the range. 

A random selection which has its mean larger than that of the general population, 
w^ill, if the mode be on the dwarf side of the mean, tend to have its mode and mean 
nearer together than are the mode and mean of the general population, while on 
the other hand, to raise the mean is to raise the dwarf limit to the range. 

A considerable number of like results might be stated, but the above will be 
sufficient to emphasize the general principle that a random and a fortiori an 
artificial selection of the size of an organ, does, whenever its distribution is skew^ 
influence in a definite manner the variability of the organ. It is quite safe to assert 
that it will also influence the correlation of organs. When we notice how wide-spread 
is skew variation in nature^ we may assert that the general rule is that no modifica- 
tion can be made in any of the features—mean sizes, variabilities and correlations 
of a group of organs without at the same time modifying all the others. 






* A paper lias recently been published by Messrs. Dayenport and Bullaed in the 'Proceedings of the 
American Academy of Science ' (see Illnstration II. below) on *' The Yariation and Correlation of the 
Glands in the Legs of Swine/' Unfortunately the authors have overlooked the markedly skew character 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 279 

As a result of Articles (15) and (16), it is possible to use the frequency curve of 
type 2/ = 2/i (1 + xjay^e"^'-'' with as much certainty as to the nature and magnitude 
of the errors made in the constants as has hitherto been possible in the case of the 

normal distribution y = -- - ^^^02.0^)^ r^^i^ method has been exemplified numerically 

in twenty-three cases in a memoir on the " Variation of Barometric Frequency '* (see 
^Phil. Trans./ A, vol. 190, p. 423); It may not, however, be amiss to illustrate it 
further in a special case having closer bearings on the theory of evolution. 



(18.) Numerical Illustration— Incidence of Enteric Fever. 
In a memoir in the 'Phil. Trans.,' A, vol. 186, p. 391, it is shown that the curve 



/ .« \ 3-673,042 

* = "»«'(' + 3:428-094) ^"'"""" 

closely represents the distribution with age of 8,689 cases of enteiic fever received 
into the Metropolitan Asylums Board Fever Hospitals. The unit of x is five years, 
and the origin is the mode at 14*3025 years. The criterion is not very nearly zero, 
although small, but the curve is graphically a good fifc (see Plate 12, fig. 9). 
The following are the numerical values of all the constants : — 

Mean = 18*9691 years. d = mean-mode. 
Mode = ] 4*3025 years. = *933,313 unit. 

Sk = skewness :::== '462,594. 

y = 1-071,453. jj = 3-673,042. 

a ^ 3-428,094. 

i/o =: modal frequency =1894*57. 

yi = mean frequency = 1687*80, 

cr ::=z standard deviation = 2*01756 units = 10*0878 years. 

From these the numerical values of the probable errors and of the correlations 
between the errors of the constants were found by the processes indicated and the 
formulae given above. 

We found 

T=: -022,525, S=*044,735, 

whence 

of the distribution. It is, liowcvor, clear from tlieir tables and plate that no selection could be made of 
the absolute number of glands witbout altering the variability of the gland distribution and the 
correlation between different systems of glands. 



280 PB0PES80R K. PEARSON AND MR. L. N, G. FILON 

Probable Error. Percentago Prubablo Ei-ror, 

p -125659 3-4211 

y -019130 1-7854 
Correlation of errors in p and y ^=^ '9581 

a -061202 1-7853 

y, 9-8029 ^5174 

ij, 23-5465 1-3951 

mean =: -014600 ^ -073 year, 

mode = '024126 ^ '121 year. 

These are the constants which determine the position and algebraical equation to the 
frequency curve, and we see at once that they are all determined with a close degree 
of accuracy. The largest percentage probable error is in p^ but this is under 
8*5 per cent., and, owing to the high correlation between p> and y a much larger 
error would produce no sensible change in the shape of the curve. 

Two important facts may also be drawn from these results, which indeed follow 
from the general formulae, iiamely : 

(i.) The position of the mean is sensibly more exactly determined than the position 
of the mode. Here about 1*7 times as accuratelv. 

(ii.) The modal frequency, on the other hand, is sensibly more accurate than the 
mean frequency. Here about 2*8 times as accurate. 

Hence the advantage of using the mean as origin of measurement for the curve is 
accompanied by the counterbalancing, and here relatively greater, disadvantage of 
the increased inaccuracy of determination of the mean frequency. 

Passing to the '^physicar^ constants of the curve, we have 

Probable Error, Percentage Probable Error. 
<T -012693 -6291 

Sk. -022845 1-3445 

d -016663 1-7854 

These fully determine the non-symmetrical nature and spread of the curve, and 
since the errors in the skewness and in the distance between the mean and mode are 
less than 1*4 and 1-8 per cent, of the respective values of these quantities, we 
conclude that skewness and divergence between mode and mean are characteristic 
features of enteric lever distribution, and not mere anomalies due to a random 
selection of cases. They are significant constants peculiar to each type of fever 
distribution and no description of such a distribution is sufficient unless their values 
are stated. 

Before giving a table of the correlations between what we have termed the 
'^ physical '^ constants, it may be well to write down some of the correlations between 
the errors in the physical and algebraical constants, which arise in the course of their 
calculation. We find 



MATHEMATICAL COl^TRIBUTIONS TO THE THEORY OF EVOLUTION. 281 



R^,^= -9581, 
R,,, = -1875, 



R 



'2yni 



0, 



SAlJy • "~" 1. 



By aid of these we find the following table of error correlations :■ 





Mean. 


2/o- 


<T. 


d. 

-•1875 


oJv. 


Mean. 


1 


•6469 


•5321 





Ih- 


•6469 


1 


•8908 


•4260 


-•1489 


a. 


•5321 


•8908 


1 


•7905 


•1584 


d. 


"•1875 


-■4260 


•7905 


1 


'9592 


Sk. 





•1489 


•1584 


•9592 


1 



Now this table enables us to draw some remarkable conclusions with regard to 
enteric fever. We see at once that no random selection of a group of individuals, 
which has any single characteristic differing from that of the general population will, 
except in the case of mean age of incidence and skewness, leave the other charac- 
teristics unmodifi-od. Thus the most probable result of any selection which alters the 
nature of the distribution of enteric fever can be predicted. The reader will possibly 
appreciate this better, if we replace the above table by another giving the absolute 
progressions in years, number of cases per thousand, &c. 

Progeession Table. 



I 

Corresponds to a 

probable change in the 

same units of 


Unit change of 


1 

One year 

in mean 

age of 

incidence. 


One case per 

cent, in 

modal year of 

frequency.* 


One year in 

standard 

deviation 

or " spread." 


One year in 
number of 

years betv^een 
modal and 

mean incidence. 


A unit of 

1/10 in the 

skewness. 


Mean age of incidence 

•Modal frequency . . 

'^ Spread" . . . . 

Interval between mode 

and mean 
Skewness 


1 

4-5852 

- -4626 

- ^2140 



•0909 
1 

- ^1093 

- -0686 
•0657 


- -6120 
7-2626 
1 

1-0877 
•5702 


•1643 

-2-6456 
•6022 
1 
2-6301 



•3372 
•0440 
•3498 

1 



^ The frequency of incidence in the modal year = ^(, x -J- since the unit is five years = 1894 5 7 x \ 
To make this 1000 we must multiply by - ^q ..^h' • Similarly A?/o x | = error in incidence of modal 



Thus we have to replace A?/q by --^----?^ (^— -^ 1000 j == 



year. 

year of incidence) . 
VOL. CXCL — A, 



1000 \i2/Q 
2 



1894;57 
~ 1000 "^ 



(error per tbQ"asand in modal 



282 PROFESSOR K. PEARSON AND MR. L. K G. FILON 

We see at once from the above table that if the mean a.ge of incidence of enteric 
fever in any group were raised, the disease would be concentrated in fewer years, the 
modal and mean incidence would be brought closer together^ and the incidence in the 
modal year of frequency would be heavier. The changes here are very sensible. 
Thus, if we raised the mean age of attack to that of phthisisj or about nine years, the 
modal frequency would be increased about 41 per cent.^ the concentration of the 
incidence of the fever increased about 40 per cent., while the distance between mode 
and mean would be reduced to nearly 2/5 of its original value. The skewness would 
not be changed. Much less marked effects would arise from a selection of modal 
frequency. Any increase of modal frequency tends to slightly raise the mean age of 
attack, to increase slightly the concentration, to draw the mode towards the mean 
and reduce the skewness. 

The changes produced by closer concentration of the attacks of the disease, i.e., the 
limitation of its incidence to fewer years, would be of a more marked character, they 
would raise the mean age of attack and the modal frequency, they would decrease the 
interval between mode and mean, and reduce the skewness. Concentration of the 
disease would thus tend to render its distribution more normal. 

To increase the interval between mean and mode lowers the mean age of attack, 
reduces the modal frequency, increases the period of liability to incidence, and much 
increases the skewness. 

Finally, increase of skewness decreases the modal frequency ^ increases the period 
of liability and the interval between mean and mode. 

These statements with regard to the manner in which enteric fever would affect 
different groups selected at random from the general population seem of considerable 
interest, for there is reason to believe that what is thus stated for enteric fever in 
different groups may be applied to different fevers in one and the same group. For 
example, the lower the mean age of attack of any fever, the greater its concentration ; 
the less the concentration, the more nearly normal is its distribution, &c,, &Ca 



(19.) Prohable Errors and Error Correlations of the Constants of the Generalised 

Prohcibility Curve of the Type y ^ ?/i ( 1 + — ) ( 1 — — ) . 

Transfer the origin to one end of the range, and the equation to the curve becomes 

y '^ h V (m, -f 1)T (m, +l)\h ) [ T) ' ^ ' * (ixxv.), 

where n is the number of observations and h is the rano-e. 

The following values are given in VPhiL Trans,/ A, vol 186, pp. 368-9, where 



MATHEMATICAL COKTRlBUTlONS TO THE THEORY OE EVOLUTION. 283 



, V ~— t/y 



Mean x 



Mode Xi =^ 



X 



or 



h (vi^ 4- 1) 
m^ 4- riL^ + 2 



r/ij) 



• • • • • 



• • • a • c c 



h (m^ — '??i2) 



^ ('//?jL + 7/^2) (^^^1 + '^^t> + 2) ' 



o ft 



V{ 



(v/i, 4- 1) (riu 4- JO 

7- -f 1 



(Ixxvi.), 



. (Ixxvii.), 



. (Ixxviii.), 



• » I J. X\- Jok. I2i.t j I 



Skewness = S 



^• 



7)L — 7;/ 



— DU / f 7/lj -f- 71U 4" 3 

+ m.j, V |(m^ 4- l)(nu 4- 1 



7n-i + m 



e a 



(Ixxx.). 



For the moments we have 



P2 



~ ' ^-^Tr^ + ly 



• aodi>fi»a 



• « 9 



t ( iXXXl. K 



/^3 ~ 



26'^ (7?ii + 1) ('//I2 + 1) 0^^ — '^'^j) 

7^ ^^: ^_^-i^ (7:^2)'""'^' " 



• ••»«»*aa 



(Ixxxii.), 



_ 2^^^±iH!^ ^iHK + 1) 0% 4- 1 ) (r - 6) + 27^} ... 

/^4— - ^^. _^ !)(,:+ 2) (7^ 4: 3) • • (Ixxxni.). 

Lastly, for the mean and modal frequencies of ^/i Bx and y. Sx, we have 



n 



Vi 



Om, + 7/12+2)^/2 



?/'^ 



n- {ni^ 4- 7«.-2 4~ 1) \/0^^i + '^''2) 
h v^(2 7rm{)n.^ 



expt4B(?ni+77i2+2)---K!5(??ii + l)~~S(7?i2+l)} (Ixxxiv.), 



expt. {S (//ii + mo) -- S (/Hi) — S (7112)} • > (Lxxxv.). 



where 



S (p) =-- 



B, 



1 . 2^:> 



c) . 4^9" 0.6. If 



— etc. 



> » e fi a 



Let 



X \^'^2 






where 



2/0 



7^ 



r (?/ij 4- 7/?2 -f 2) 



h V {ill, 4- 1) r {m, 4- 1) 



(Ixxxvi.). 



then I (7^15 m.) = 77, and we easily find, by the fundamental property of T functions, 
that 

2 o 2 



284 



PBOFESSOR K. PEARSON" AND MR. L. N. G. FILON 



I (mi — I5 7112) = 
I (mi5 ^2 — 1) = 
I (nil ~" 2, mo) = 
I (niiy m-> — 2) = 



— -■ - - -- n. 

7)1, 



m^ + ni,, -\- 1 



ni.f 



TL 



Wi {niy — 1) 

(';r/j 4- m.2) (mi + nu -I- 1) 
riu (nL2 — • I ) 



^ 



?;,, 



/> 



r^. 



7 e a e 



From (i.) we have 



log y = log n — log & + log x + "^h log y + ^^^2 log ( 1 



rvl 



wiiere 






(Jxxxvii.), 



h 



(Jxxxviii,). 



It will be needful to find "^-^A , '^t^ , and ""J^^ 

dm-; ihm dm, dm., 



diJi\ 



Similarly 



where 



a?' r {nil + "^H + 2) 
dm\ 

# r (m, + m, 4- 2) 

^^ (7?ll 4- ^% 4- 1)~ 



^^"r(mt 4- 1) 
dw^l 

dr r (r% + 1) 
drnj 






£[ 



d"-^ (log x)/dml '^ €0 — €2 



S 8 « 4 3 SI 



cf (log x)/<^'Wi 1 t^mo 



€:; » 



a « « I 1a.XX1A.» »• 



{ A Of I J 



» t 4 * 



ci ^ d^ r (mi + l)ldml e. = dr V (nu + l)ldnii ] 

C3 z^ d' r ('^1 + vu + 2)/(i (mi + vh + 1)" 



» el A. V^ JL • / J 



» . > \->:'k.O±J.« /• 



€1, €2, and C3 can now be readily expressed in senii-convergent series admitting ot 
calculation. 



w 



here 



and 



2m^ — 1 , S (m^) 



'\ 



Co • 



%%., — 1 . S ('?/i,) 



2m^% 



Mmy-^B 



m^ 



> 



« » 9 



^^ 2 {111^ 4 '//i-i 4" 1) — 1 , S (//^i -1- uu 4- 1) 



2 (i^^i + m.r 4- 1)"' ' (^^^1 + '^^h ~V 1)^ 



5 (p) - Bi/^) ^ Ba/^/ + B,/2/ 



^ 



— - ^ J. {o {y) 



^ 



(xciii.). 



« « « 



. (xciv.). 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 285 

It is clear that if Mi and t?t. be at all large, which they frequently will be, we may 
omit the series B, or even reduce €i, €3, €3 to I /my 1/mp and 1/(^1 + /% + 1)? 
respectively. 

Making use of (Ixxxvii.) and (Ixxxviii.) we easily find 



a 



II - 



nhn = — j 



^ d' (log y) J n f , . , , , ^ . 

y ^'-^'^ dm = - (^^1 + m,) (mi + m. + 1 ) 



+ -------^ j . (xcv.). 



-^^i — 1 ' m^ — 1 



cu 



12 






.#- 



'^^ c?-(logi/) , 
(^ cm do 



5'-^ ^H.> — 1 



« « « < • \ •^»-'-' V 1. /« 



0^1. 



13 



r^^,. 



13 



' d? (log y) , 
dx dm I 



n m^ 4- m^ + 1 



5 



7)1, 



• «»••»» \^\_/Vll» I- 



a 



14 



110 \A =^ 



14 



If 77 l-vtAy 

WcC ^'??l2 






• * • I 



« < « t 



( xc viii. ). 



Cv'yy 



nluy 



^^ d^(logy) , n . , , . m, 4- 1 



£»2 



t?la 



1 



« < t 






a. 



23 



'/^6.; 



23 



f 



,9/ ^^ H ':!.,.-l //o^ — — 

do dm^ 



n 



*♦•••«•»»« 



» « • 



• I o« f« 



C'>>/ 



nh.> 



u 



-1 



'y :10?i!i>. i^ 







cffi elm., 



?i rill + 1 
6 '/?i.> 



ft « • » « 



» • 



. (cL). 



Ctn<. 



u 



nh. = — 



'as 



'^ «:;^^ (lo^ '?/) , 
0^ dmi 



n (ci — €3) . . . 






(^hi — • ^''^^34 



f 



^' d^ (log ?/) , 

^» ^_ _._^ „J2_._-_- fl^Y* 

dull dm.y, 



ncj 



* • • • 



/ » ♦ • » 

• » » » . • » 1 0111» i» 



^'^'44 ■ 



nb,, 



44 










— ^ f^ ( Co """ Ggj » 



» « • » 



»••»»• 



(civ.). 



The next stage is to calculate the determinant 






hn, 


^n. 


&13, 


h. 


hv>. 


^22) 


^'23, 


^24 


^J3> 


^hz, 


^33, 


^>34 


&I!, 


hi, 


K, 


^44 



bind the minors B,,/, &C.5 of 5,,', &e. We shall then determine 






'/i 






V2 

Jlmmjh 



1 o.,., 



> - 



00 



11 






^2 



'1% 



7^ A' 



• a 



(cv.)^ 



a 



18C 



PROPESSOE K. PEARSOS" AND MR. L. N. G. FILON 



and the correlations 



Ji 



hb 



E 



6»ii 






E,,,.. = 



■^hnii 



R 



h/il.) 



B.2[, 

-/(Bg^By) 






E 



vo^m^ 



7(BirBtO 
v/(B3,B,,) 



"\ 



r- 



-^^ 



(cvL). 



The algebraic expressions for the expanded determinant and its minors are verj 
lengthy, and it will be found easiest in any numerical case to calculate the numerical 
values of the hn, by,, hy^, 614, &22 . . ♦ , ^nd then find the values of the determinant and 
its muiors numerically. So soon as the above four standard deviations and six 
error correlations have been calculated, the determination of the probable errors in 
the fundamental constants of the frequency distribution becomes easy. 

We have for the mean organ M^, if h now defines the origin of coordinates, 



AM, :=^ A/?. + Act' 



A./^ + 



1, A6 + 



h {m^ + 1) A7/1.2 



Whence 



m^ -f m.) + 2 



{iii^ + m^ + 2)^-^ {rti^ -f 11U •+• 2)^' 



(evil,). 









7' 



).!■ 



m-z 









BMil HI 



2&(mi -f- 1) (7% 4- 1.) 



IV^ 



^b^m-^^b 



21) (m^ + 1) 



7'" 



■^b'^Hh bi'il-> 






03^d9e£<l9D 



(cviii.). 



Similarly, the modal value of the organ can be found from"^^^ 



AMn =: ^h + 



'7)1 . 



m^ + 7n, 



A6 + 






m 



bniy 



(in^ 4- ra.^^ 



~y Aw? 2 



(cix.). 



* The easiest numerical metbod in this, as in the previous case, is to proceed as follows :--Wrifce 



^MoXftTo 



^hXh + — 'r- -^ ^bXb + 

7>^l -f- 7?^ 2 



(m, 4- ^2)2 ^"h^ra, -' J^^^\^ ^^^.^^ 



2 ^niAm^ 



wliere the x^ ^^*<^ umbral symbols, and let ^1, N., N^, I^^ be the mmierical values of the coefficieuts on 
the right. Theu put 

^MoXMo - '^'iXk + ^-2X0 + ^-sX,^, - '^iX;u' 

ISTow square this equation, and, whenever a product, x,iX,,u occurs, multiply it by the corresponding error 
correlation, R^^/, already calculated, putting E;^,./ =:: 1 if q -:= q . Then, actually, 






N? -f N' + ^-^ + m + 2NiN,Km + mj^^in., - 2X,N,E/,,, + 2N0N.B6.H 



Barlow's Tables rapidly give the squares and Crelie's Tables, or a Brunsviga, the products. 



MATHEMATICAL CONTRIBIITIOIS^S TO THE THEORY OF EVOLUTION, 287 



In like manner from (Ixxviii.) 






Ah 



+ Am 



I 



h ' ""^ \ W] •— m^ r/?j -f 7)1^2 "^^'h ~^' "^'h + 2 



4" Amo 



{ 



1 



1 



1 



rU] — m., m^ + 7/? 2 '^^^1 + '^^^2 + ^^ 



yaj^j where the numerical vahies of Cj and Co can easily be found in any actual case. 



2c 



'?, 



ZC Ac 



5 



Z^ 



Again, from (Ixxix.) 



Ao- A^ 



<T 



+ A?>^i 



1 



a llliMM 



r 2 (r + 1) ' 2 (wi, + 1.)^ 



+ Am._, 



1 



1 



+ 



1 



2(r + 1) ' 2(^2 H- 1)/' 



Or Acr/o- = A6/6 + ^i Awii + e.. Am,, 



S: 21 



<r 



cr ^> 



.Lit- i 



9 



^o 



ll + ^i^»^i + ^^^"'^,3 + ^^-fi^m^m^mini.:, + Y^ ^^^v^/R^/Hi + '^iT' ^ b^ m^hm.., (cXl.). 



Further from (Ixxx.) 



AS;. , f 1 1 . 1 1 

S/, ■ [m, •- m^r '%■+ ^% 2(/7ij -f -nu + 3) 2 (mj + 1) 



1 



1 



Hence 



f 1 - . 

/i A7?7i +/, Am., say. 



Ss,, • — b/,- 1 J 1 S;;;;,^ +/ 2 Sij^^ + 2/1/2 S.^^^S^r^^-tl^j^^^^j 



* » • » 



(cxii.). 



From the results given above we can deduce the effects on size, rang^e, variability, 
or skewness of a selection at random of any one of these four. 
Writing (cvii.) 

AM, = Ah + g, Ah + g^ Am, — g,^ Am. - 
we find 



R 



^MeC 









h 



+ ^'2y2^m^^m2^'mYm., "~' ^'lO?^^, \ .,.*., (cxui,). 



288 



PROFESSOR K. PEARSON AND MR. L. N. G. PILON 



xIm.a — ■- 






^liMm + f/l^b .+ fh^mii^'m^h '^ f/?^mM'mJ. 



(cxiv.). 



^k 






t /25'l26S„j^^K/;,,;i^ + y29'2Si-rt^S,i;,l:^„,^.,5,^ '— Joffz^^^n^ 



* e » 5 t> » 



(cxv.). 



These results show us ^Aa^ '^'^ is, in the general case of shetv variation, impossible to 
select any one of the quantities— mean size^ range^ variahility^ or sJceivness of an organ, 
tinthout at the same time in all prohahility modifying all the others. 

For example, the frequency of the incidence of certain types of diseases at different 
ages follows a distribution of this character. Hence, if any special class of the 
community had a mean age of incidence differing from that of the general population, 
we should expect correlated changes in such other characteristics of the disease as 
(i.) its first appearance ; (ii.) its last appearance ; (iii.) its tendence to heavier 
incidence above or below the mean age of incidence ; (iv.) the concentration of its 
incidence about the mean a^^e of incidence for this selected class. 

Precisely similar series of changes would arise in the case of a random selection of 
individuals having the variability of a certain organ greater or less than that of the 
genei'al population, there would be correlated changes in the size, range, and 
skewness of the distribution of this organ. 

Turning to the mean and modal frequencies, we have 






Similarly, 



17 






3 

2 



1 



lUy "f- mi^ -f 2 



^ m-i -f 1 d(wti 4 mg -f 2) "^ d(mj + 1) ^ ^ 



*r ) 



1 1 

3 1 

2 ---y-;;;;:---- — ^ ^"^"^7]^ 



1 ^ mj -f '??u 4- 2 

Ab/h + hi Ami + ^h A^iio? say 



"T' 



d S (m^ + "^Wg 4- 2) 
d (mi + QJh + 2) 



» If a a Xi 



© » 



M 
b 



mmtmmm' 






1 



1 






d 8 (m^ + r/ig) 



nil 4- '^h + 1 2 (m^ + "in^) 2mi d{mi + m.^) 



rZS(m.4-l)l ^ 
^(^(m. + 1) J 

. , . (cxvi.), 



d^(m,)] 

—r — ~ > Amt 
ami J 









1 



1 



Ml + m.2 + 1 2 (m. + ^^ig) 



"T" 



^ S (m^ -f ??i2) d 8(712.,) 

dm^ 



%7l^ * ^H'^^^l + ^^^2) 



> Amo 



A6/6 + ^^1 ^^^1 + ^h ^'^^25 say « . . 



@ & 



•9 » 9 « « 



(cxvii.). 



Here /i'l, h^, hn h^ can be easily calculated, if we note that 



d S (p)/# = — Bj2p^ + B3/4J9'' - BrJ6p' + . . . . . 

s= -— T/p, where T is the same as on p. 272. 



(cxviii.). 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 289 

:£y^ and %j^ can be foTind in the usual manner by squaring after the insertion of 
the numerical values. 

From (Ixxxi.)--(lxxiii.) the probable errors and correlations of the moments can be 
found if required, the calculation being numerically somewhat laborious, but presenting 
nothing of novelty. 

The probable error of the criterion 

K = 3A - 2A + 6, 

where y8i =^ />tl//x2 and /So = i^^JiA, may be found as follows : Put e = {m^ + 1) (mo + 1)^ 
r := mi + ^2 + 2 ; then we find 

'^ "" (:^. + 2)2(r + 3)6 (cxix.;, 

and accoi*dingly 

A/« / 2 2 1 , 1 \ . . / 1 1 \ . 

— z=z [ — — — -- - — — - ^^- + ■ r- — Ar + 7""^ • — Ae 

ic \ r r + 2 r 4- o r 4- 1 + e/ \r + 1 + e e / 

2 2 1,1 f + l \ 



r '/' + 2 r -f 3 ?^ + 1 + e (r + 1 -f e) (r/i^ 4-1), 

/_2^ 2 1_ 1_ _ _^ ^^±A \ 

"^ \ r "~ r -h 2 "~ r + 3 "^ r + 1 4- e (r 4- 1 4- e) (7^2 4- 1)/ ^^^^'^ 

= ^l Ami 4- 4 Am., (exx.), 

where ii and 4 admit of easy calculation. Hence 

%Jk = "liAm-^ + '^2 2^3 "T~ ^'^-'1^ 2 ^'/Hi^wg -1^7/21'/% (CXXl.j. 

The value of X« can thus be found, and the steadiness of the curve to its type 
ascertained. 

Illustration. — Glands of the Fore-legs of Stvine. 

In the * Proceedings of the American Academy of Arts and Sciences/ vol, 32, 
p. 87, 1896, is a memoir by C. B. Davenport and C. Bullard^ on the variation in 
number of the Mlillerian glands in the fore-legs of 4,000 swine. The paper es2:)ecially 
attracted our attention, because the authors are content to describe the frequency 
distribution of these glands by means of a normal curve. They write, after 
discussing the plotting of the normal curve on their diagram (pp. 90-91) : — 

^' These and other characters of the ' probability ' curve are indicated in that shown 
in dotted line"^^ in the accompanying diagram. The diagram also shows the curve of 

* The authors actually represent the normal curve by axi 18-sided polygon, 
VOL. CXCT. — A. 2 P 



290 



PROFESSOR K. PEARSON" AWD MR, L. N. G. FILOF 



distribution of the various numbers of glands occurring on a leg from 1 to 10'\ 
This curve is drawn from the right female leg only ; the curve for the other legs 
would be very similar. We shall speak in a moment of the method of construction 
of these curves ; but we want now to call attention to the fairly close similarity of 
the two curves— that gained by observation and the theoretical one — a similarity 
so close that we are justified in concluding that the law of distribution of the 
variants in the leg glands of swine is the same as that of accidental errors.'' 







No, of Glands. 



owj in our opinion, the curve was markedly skew^ and it seemed to us that most 
interesting properties bearing on the action of selection on the MuUerian glands in 
swine actually depended on this skewness. We have taken the distribution of 
glands for 2,000 $ swine. 

To illustrate the difficulty of applying the normal curve we may remark that it 
gives about 6 swine per mille with — 1 gland, and about l'i5 with — 2 glands, while 



* Tlie autliors have forgotten tliat there V6 a tionsible perceiitage of zero^ghiiicis. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION, 291 



it gives 30 per mille instead of 10 per mille with no glands. These difficulties are 
entirely met by the skew curve, which gives no frequency v/hatever of negative 
glands (see figure). 

Taking the number of glands in the right fore-leg of female swine, we have the 
frequency series : 



No. of glands . 



Frequency . 
Per mille . . , 






1 

209 


2 
365 


3 

• 

482 


4 


5 


6 
134 


7 


8 


9 


10 


15 


414 


277 


72 


22 


8 


2 


7-5 


104-5 


182-5 


241 


207 


138-5 


67 


36 


11 


4 


1 



We have worked with the frequency per mille for convenience of reduction, 
although the actual number of observed cases, 2,000, is used, of course, in the 
determination of the probable errors. 

Using the method of the paper in the ' Phil. Trans.,' A,, vol. 186, p. 367, we found 

a-— 1-680,774 
A = 0-259,1825 
A = 3-110,8211 
6 + 3/3, - 2/3^ = 0-555,905 



mean = 


= 3' 501 glands. 


/X2 r 


= 2*824,999 


/^3 = 


= 2-417,278 


^4 = 


z 24-826,297 



Thus the criterion is greater than zero, or the frequency distribution is of Type I., 
or has a limited range. 
Proceedinfif we found 



r- 19-985119 




e= 72-71918 


mi — 3-788718 




m.2 = 14-201402 


Ch — 3-79623 




ao~ 14'24837 


h - 18-0446 




«/o — 237-263 


d — 0-522996 




Sk. — 0-311164 


Mode— 2-978 


Start of 


curve — 0*818 gland. 



Thus it would appear that both the distance (d) from mean to mode and the 
skewness are very sensible, and that, unless their probable errors be very large, it is 
quite impossible to represent the results by a normal curve. 

We may note that the range starts from — 0*8 18 gland and runs to 17*227 glands, 
so since it gives zero at — 1 gland, we see that it sensibly confines the possible 
number of glands between and 17, but we should have to examine considerably 
more than 2,000 swine to have a probability of more than 10 glands occurring. The 

^ 1. ^ 



jfj z) /. 



PROFESSOR K. PEARSON AND MR, L, K, G. FILON 



total range given is thus both in magnitude and position extremely satisfactory, and 
supposing only the frequency, not the actually measured quantity, i.e., number of 
glands, to be known^ the theory would have given, a very accurate determination of the 
limits of possibility, especially the start of possibility witli the whole number of glands. 

In order to work out easily the determinant. A, and its minors, we found it 
desirable to bring out certain factors and reduce the formulae given above to slightly 
different forms, which, as they are likely to be of general service, are here repeated. 

Let a ■:=• miCj — (mj + mo^eo^, /3 = mge.. — (mj + ^rio) e^, where e^, eg, e^ have the 
values given on p. 284, then we found 



^/^^^■(??^H-^^^■2 + l) 



hhn^m^ {m-i — 1) (m^ — 1) 



— (mi — 1 ), 

M2(mi — 1), 
^2 + 1, 



0. 



1 



-.-, 



0, 



0, 
— (^^i + ^%~*"2) 

nil -^ 2m2 + 1 

2 ('?7ii + ^^2 + 1 ) 



Ah 



71'' 



'hhn\m\{m.y-— 1) 



mia + mg^j 
— (ms— 1), 



— mise, 
Tdi (m^ *^ 1 ), 



1 



— (^1 + 1) 

(m, + 1) (w, + ^2 + 1) 



ixo') 



72.^(^1 4- '^^ +1) 



^^1 + ^'^2. 

7722 (mi — 1), 

^2 4" I5 



1 



??^2 (% ^^ ^\\)'. 



0, 

mj — 2m2 + 1 
2 (mi + ^2 + 1 ) 



A.<, =1:: 



^^^(mj -f m2 4-1) 
Vm{{mi — l)(m.2 — 1) 



79^1 + mo, 
(mo + 1) (mi 

ry?/2 + I5 



^)> 



], 



«i? ( e 



0. 





m-i — 27^1 + 1 
2 (17^, + 7n.2 + 1 ) 






44 



IV' {711^ + m^ + 1) 



VhnlmKoTii — l)(«i3 — 1) 



0, 

nil + mo + 1, 



2). 0, 

mi^ + ^oA 
mo^, 



(mi — l)(m3--l) 



mi a 



m^m'o€o, 



x\.j2 **'*" 



IV' {mi + mg 4- 1) 
lrm{}nl{m^^ — 1) 



0, 



(mi + 7?7o), 



miOUoe.,, 







9no^, -— 1 

(^?ii + 1) (^^% "^ 1)5 ^^^2 + 2mi + 1 



A -^^ ^^"(m^ 4 - m.^ 4 1 ) 
^'^ Wvi^m.jXm.;, — 1) 



0, 
] 

(nil + ^%). 



fx. 



7/i2^;-',5 



(m/o — 1), 







(mj + 1) 



MATHEMATICAL CONTRIBUTIONS TO THE TJIEORY OF EVOLUTION. 293 



A 



]4 






24 



A 



34 



n-'{m^ -\- m^ + 1) 






0, 




«?,]« + }?j 


s, 


— oriiOL 


V^m\oni (m^ — 1) 




» — 


^2 1, 

(mi + ^^2)? 




- I, 




m^moeo^ 

— - mi 


n^ (mj 4- ^^^2 4-1) 




mi + ^>^ 


2? 


- 1, 




0, 

mi — 27^2 + 1 


VniiTti^). (^^^'1 ""• 1) ("^^h •— 


1) 






m., + 1 


5 


0, 




2 (mi + m2 + 1 ) 1 


7)? (m| + m^ +1) 


m.i + ^2, 


— (m 


, + 1 ) (m, — 


n 


— {Mi - 1 ) 


¥m{)nl {m^ — 1) {m^^ — 1) 




1, 


i 


— rthm^^z, 




ono/3 






0, 


m.. 


— 2mi + 1, 




~-(7ni + ??i2"-2) 


lfm\m^2. O^h "" 1) ('^^ 


1) 


1> 


mi + mo, 

-1, 




— (mi — 1). 

m,«. 


) 


mim263 


?-2 — 








0, 


— 


(mi + ^'^2 ■" 


2), 


mi — " 2m2 +1) 



In our particular case we found 

Ci z= -232,4012, 62 = -067,9945, €3 = '051,3099, 

m,a - -». -164,4934, on^/S = -607,8513. 

With the aid of the values for mi and m2 given above, the determinantal parts of 

the A's were then calculated. If these be 8, ^n? ^22^ ^33? ^44? ^12? ^n, ^m? ^23? «24? ^34? 
we have 



S 



a 



11 



a 



22 



OCoo 



a 



u 



•153,7969, 

•348,3713, 

13-018,7332, 

47-671,9443, 

13-357,1309, 



ai2 


— — 


-274,9969, 


^13 




-111,8280, 


^14 





•211,9650, 


•^23 





22-706,5156, 


OC24. 





11-507,0153, 


^34 


__— 


25-088,6121. 



From these the standard deviations and correlations of errors in the alo'ebraical 



constants are at once found. We have 



t. 




-2325, 


-Stoi 


— 


•7784. 


%,l. 


_»- 


5^5908, 


t, 


rr: 


3-7602, 



l-^/«)2l 


•9387, 


4^Am3 


7548, 


1^7/6 


•7143, 


■'-^Ej'ing 


•91145. 


^'\n.ib 


•8726,' 


IVo6 ^^^ 


-9942, 



294 



PROFESSOR K. PEARSOK AWD MR, L. N. G. FILOH 



Then as a step towards tbe determination of other probable errors, the standard 
deviation and umbral equation^^ for y = rrii -^ rrio were found 



This led to 



■t^ = 6-3085, 

X, = Antl l'091,2932x., + Anth 1'947,5528x,h,« 



^yiiii •""" *^OiZ, 



Ryij = '9888; 






7848. 



By aid of these auxiliary results the probable errors of all the algebraical and 
^^ physiear' constants were determined. 



Probable Error Table. 



Constant. 



o 



CD 



CO 



O 



<^ 



mi 

m.2 

a.2 



"Range . . . 
Start of range 



I "g Mode 

*^ J <J Mean . . , , 
^ ?^ Standard deviation 
^ Mean to mode 

Skewness . . . 

Modal frequency . 



Probable error. 



0-5250 
8-7709 
0-1748 

2-4351 

2-5362 
0-1568 

0-0398 
0-0253 
0-0183 
0-0294 
0-0158 
3*2455 



Percentage probable 



error. 



13-8762 

26-5533 

4-6056 

17-0903 

14*0554 



1-0911 
5-6308 
5-0655 
1-3679 



Now it will be clear from an examination of these results that all the " physical " 
constants are determined with great accuracy, t The mean is subject to less probable 
error than the mode, the modal frequency has a slightly less probable error than the 
mean, and as it is less than 1-4 per cent in the former case, either are closely 
known. The skewness and distance from mean to mode are known respectively 
with less than 5-1 and with 5 "6 probable errors. Thus they are both significant 
constants. In other words, the curve differs significantly from a normal curve, and 
it is erroneous to represent the frequency by such a normal curve. The range which 
ought to be such that there is no frequency at - 1 gland, gives no frequency at 



* See footnote, p. 286, and later, p. 305. It may be as well to remind tbe reader that here, as in the 
other illnstrations, logarithms of the full, not the cited values, were used in the calculations. 

t The probable percentage errors m^ mg, a„ a^ are high, but this, as we have several times pointed 
out, is of small importance, as, owing to their high correlation, the actual shape of the curve is not 
changed sensibly by large changes in Wj and Wg, 



MATHEMATICAL COKTRIBUTIOISrS TO THE THEORT OF EVOLUTION. 295 

— '818 gland with a probable error of ± *157. It is, therefore, clear that our 
method gives the start of the range with very considerable accuracy. The whole 
length of the range runs to 17*227 glands, with a probable error of 2'536. We may, 
accordingly, conclude that the maximum possible number of glands is hardly likely 
to be less than 16 or more than 20. We consider that this example is a good 
illustration of the accuracy with which the principal '^ physical ^^ characteristics of a 
distribution may be obtained by aid of skew curves, and how they provide much 
information which is not given by the use of the normal curve. 

The next point is the determination of the umbral equations giving the error 
correlations of the " physical " constants. They are, if AntL stands for antilogarithm : 



X 



mean 



Antl. •792,4156x;. — Antl l-380,2040x(, — Antl. 1-153,9620 
4- Antl. l-508,1033xm,, 



X range — X^" 

Xy, — - Antl. l-011,7885x6 - Antl. •248,2856x,,h + Antl. l'097,6534x 
X- = Antl. 1-109,9660x6 + Antl. -168,8507x,«, - Antl. l-151,0582x„y 
Xa — Antl. -397,2701x6 — Antl. -274,l702x», - Antl r-810,3180x,„„ 
X.A-. = - Antl. -381,5919x™, + Antl. •367,7012x»,. 



m.^ 



Multiplying these out pair and pair, we found 



Error Correlation Table. 



ean 



[iange . 



4 * 



Modal freq;uerLCy . , 

Standard deviation . 

Mean to mode . . . 

Skewness . . . . . 



Mean. 
1 


Range. 
•0232 


Modal 

frequency. 


St^,ndarc. 
deviation. 


Mean to 
mode. 


SkewneHS. 
•1309 


•3400 


-•3493 


•0500 


"0232 


1 


•6284 . 


•0906 


•2132 


•2175 


•3400 


•6284 


1 


-'6944 


-•1473 


-•0141 


-'3493 


•0906 


-•6944 


1 


•5891 


•4394 


•0500 


•2132 


•1473 


•5891 


1 


•9847 


•1309 


•2175 


-•0141 


•4394 


•9847 


1 



Hence, proceeding to multiply rows and divide columns by the corresponding 
standard deviations, we have, after altering the units^ the following 



296 



PROFESSOE K\ PEARSON AND MR, L, N, G. FILON 



Correspoiads to probable 
cliange in same units of 



an 



Range 



set 



• « » 



t » 



Modal frequency 
Standard deviation 



Interval from mean to mode 



Skewness . 



» f 



Progression' Table. 



One gland 
in tlie 
mean. 



1 



2-3231 



18-3876 



•2533 



•0583 



•8156 



One gland 
in the 
range. 



•0002 



1 



•3389 
•0007 
•0025 
•0135 



Unit cliange of 



One 

per cent. 

in the 

modal 

frequency. 



■0063 
1-1651 
1 

"-'0093 
-™-0032 
-™'0016 



One gland 

in tbe 

standard 

deviation. 



-"•4818 
12^5279 



One gland 
in the 

interval 
from mean 

to mode. 



•0430 

18^3604 



51-7973 ™6-8411 



■9459 
3-7761 



•3668 



5-2706 



YT)- in the 
skewness. 



•0210 
3-4992 

-•1225 
•0511 
•1840 
1 



An examination of this table brings out several interesting features of the 
frecjuency distribution of MuUerian glands in the fore-legs of swine. If a group of 
swine were isolated, and found to have a higher mean number of glands, then this 
group would most probably have an increased possible range, but at the same time a 
decreased variability and a marked increase of skewness. This increase of the 
possible range with a decreased variability is especially notable, since the rough-and- 
ready class of statistician is very apt to treat the range observed as a measure of 
variability ; we have here a case in which the same cause^ raising of the mean^ 
produces opposite effects on range and variability^ Increase of range, it will next be 
observed, produces very little effect on any of the physical constaiits, but such effect 
as there is, is an increase of them all. To increase the modal frequency is to increase 
the range and to reduce both the variability and the skewness. Thus the more 
mediocre swine there exist in any group, the more nearly their distribution will be 
normal. ChangQ in the variability is the cause which on the whole produces most 
effect. Increased variability means lowered mean and less mediocrity, but much 
increased skewness. Finally increased skewness denotes probable increase of range , 
variability, and mean. 

As we have suggested in a previous illustration the principles of multiple corre- 
lation easily enable us to predict the probable change in a random selection in which 
two or more of the characters differ from those of the general population. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY 0? EVOLUTION. 297 



(20.) Probable Errm's and Error-Correlations of the Constants of the Generalised 

Probability Cnrve of Type 



y-Vo 



{1 + {xiaf} 



m 



e 



■ V tan "^ (x/a) 



(cxxii,). 



This curve is discussed at length, 'Phil. Trans./ A, voL 186, pp. 376-80. The 
chief constants are given as follows, if m =: ^ (t + 2), z =: v^ -^ r^, and h denote the 



orio^m :- 



Moments- 



CO 



(cxxiiL), 



IH 



T^ (r — 1) (7- — 2) 



« • 



>•..«* lOXXlV.J, 



/X4 



0'' (7' - 1) (r - 2) (^r - 3) 



Distance of centroid from origin 



(cxxv.). 



avjr .... (cxxvi.), 



av 



Size of mean organ rz: /;, — — (cxxvii.), 



Size of modal organ =z h — 



av 



+ 2 



(cxxviii.), 



Distance from mean to mode = d 



2va 



Skewness = 



2v 



+ 2 



T +2) ^ ' 

'\/ i o , o I • • * (CXXX.), 



a 



Standard deviation = cr == — -;-- — - ^\r'^ + ^^) • • (cxxxi.), 

r y^(r — 1) ^ ^ ^ ^ ''^ 



w 



here 



and 



3/, = — e>'-/G (r, v) 



« 9 



* « lO^SVAAXlty, 



G (r, v) = f^siiy^e-" 

Jo 



c^^ 



G0%^) = ^fr4-GO'-"2.'^) • • • 



r^ H~ ^" 



• . . . \ ox,^ jvxn. I J 



is the formula of reduction. 
VOL. cxc.r.»— A. 



Q 



298 



PROFESSOR K. PEARSON AND MR. L. N. G. FILON 



Further, we have the following Bernoulli number series for G (r, v), where 

tan ^ = vjr :— • 

log {e"^'"'' G (r, v)} =■ log A^(27r/r) -f" ('?' + 1-) log cos (j) -^ v(j) 







(25 + I) {2s -f 2)7" 



-^^ (1 — 2^^'"^'^ cos'^'^^ (j) cos 2s -^ 1 (j)) > . (cxxxiv.). 



cip/r in (i.), and we 



To find yi Sx^ the mean frequency, we have only to put ;x? = - 
have 

log yi n: log ;yo 4" {'^* + 2) log cos cj) -{- P(j) 

= log 72 — • log a — - log {e""^'''' G {r, v)} -{- {r + 2) log cos (f> -\- v(f) 
= log n — log a — log y^(27r/r) + log cos 6 — x^ 



where x stands for the summation in (cxxxiv.). Hence 

y^ = ^ Vih) "" °^' ^ 



« » « 3 



(cxxxv.), 



or 



/(27r) <r 



Vt-:i 



e' 



•X 



« > 9 



(cxxxvi.). 



As typical constants we require the probable errors of the mean^ the standard 
deviation, the skewness, and the mean frequency. It is clear that these will require 

us first to find 2/i, Sr^ 2„, 2^? and xl/^fj, i^kr^ i^hv^ -tt/m^ -ttr*'? -tim? ^^j/a* 

We shall only indicate briefly the steps towards finding the integrals of the second 
difterentials of log y. 



log y 

cliiogy) 

dx 
^P(log^) 



log ^0 — ^^^ log {1 + {^l^f} ^ '^ tan ^ (o^/a). 



V 



(2mx)la' 



a {1 + i^i^jaf} 1 4- (^^V«)'' ' 



dx 



.2 



2 



^2 



1 



vfcja 



m 






9 



??Z' 



{1 + {xjafY 1 + W^O' (1 + '^''Z^^') 



(2/a^) { — J/ sin 6 cos^ ^ — m cos^ ^ + 2m cos* 6} 



+ -^ fP log y 



».v 



2'?i 



^i^ G (r, z^) 



'7r/2 



e>V I sin6^cos"+'^6""^'^c^^- mG (r+ 2,i^) + 2mG {r + 4,p) \ , 

"7r/2 



whence, remembering that 2m = r + 2, and integrating the first integral by parts, 



we 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 299 






— 00 



(Ix^ 



or. 






2n 



*2 



% 

a/ 



\Lj^ + (,. + 2)) ^l^f^^^ - M^' + 2) 



(r + 4) {iJ^ + 0' + 2)-} Gr {r 4- 2, ;;) 
jt;3 + (r -f 4)2 G (n z;) 



^L (^' 4- 1) (r + 2) (r + 4) 

2 "~~~~ 



a 



j,2 ^ ^^. _^ 4y 



G (7^ v) 



• B • I O-A. A A, V i I • /• 



Precisely similar reductions lead us to 






^'13 ^ 






•CO 



dm (Ir 



%v (r + 1) (r + 2) 
7iz; ('/' 4- 1) 



• > • t 0^-A.X. V lilt /j 



(cxxxix.). 



• • • • 



C^14 - 






+ 05 



*' --«5 



r^o? civ 
^^' (log y) T 



11 (r -f 1) (^^ + 2) 
a v^ 4- ("?' 4- 2)^ ' 



^^ / , ^ \ ^^ + 2 (f 4- 4) 



ce 



^2 j^ ^^. ^ 4^^ 









•' —oo 



da dr 



% ir -h (r + 2) 



• • 



«9. 



24 






•^ — 00 



r4- 



c^o. (^z; 



a.: 



33 



2/ ^,.3 ^»^ — "~ 



J — 00 



(^'? 



a 



34 






a 



44 






dr dv 



y~~~d^ 



Ct'Ob 



d) 



>3 



^^ 



(cxL), 



(cxlL), 



» • • « IUA-xII./j 



^i z; (;r 4- 1 ) /^ r" \ 

re z;-^ 4- 0' 4- 2)^ ^ ^' 

n ^ {logG(ni/)} . . . (cxliv.), 



g;:^{%Cf-(n^)} . . . (cxlv,), 



d^ 

^^ ^2 {%G^(n^)} . . . (cxlvi.). 



It will BOW be needful to find easily calculable series for the second differentials of 
log G {i\ p). These can be obtained from (cxxxiv.). We find 



dy 

d7 



: {log G (n v)} - 



^ + log cos <^ + 

" B 



T 



S 



'2«+l 



(- ly 



(2s + 2) r2 



,2s + 



- [1 - 2'^'+'^ cos'^'+''<^ cos {'Is + 2) ^} . . (cxlvii.), 



dv 



{logG(n v)} 



i 



^ sin c^ cos ^ , 



+ ? ^TT^ ^'--^ cos--^(^ sin (26; + 2) <^, 

2 Q 2 



, (cxlyiii.), 



300 



PBOFISSOE E. PEARSOK AKD MR. L. KG. MLON 



~, {log G {r, v) 



1 



1.3 



'5 + sin'^c^ (r — 1 — 2 cos'^^) 



00 / 





^H^ {1 - 2^^'^^ cos^^+^(j& cos (25 + 3) ^} |(cxlix,) 



dv' 



; {log G {r, ^)] — — i COB^^ (2 — 1 + 2 Bin'^^>) 



9 



+ S — ~:r^ 2-'-' " cos'^' ■ V^ cos (26- + 3) 4> 



,^.2&H-1 



I 



(cL): 



# 



(Ir civ 



{logG(r,.)} 



i 



sin ^> cos (^ (2 cos'^<^ -- r) 
- S ^-~~-^~~^ 2^^+^cos^^-^^9^ sin (2^ + 3) <j^ 



jlJ-e J > 



These allow of the fairly rapid calculation of ^33, a^^, a^i^. The values of the 
standard deviations of the errors, and of the error correlations, can then all be 
calculated from the determinant 






a 



11? 



0.>vy% C^ia* ^^' 



'12 J 



13; 



145 



^H'l'i ^h'li ^hsti ^^24? 

^13? ^^23? ^^33> ^34? 



a 



u> 



^24? ^^349 ^44? 



and its minors. 

1 






xjet Oppf 



1 



a. 



n 



w- and let A' = -t A, then if B«„' be the minor corresponding to 



&py in A', we must work out for any special numerical case 



E 



aJi 



JR 



vh 



R 



av 



^'^ - n A' ' 




n A" 


^2 ^ ■'^s;? 

"''" ~~ 7i A'/ ' 


s^ 


1 B,, 

~ n A' ' 


B,3 


11* 


Bis 


^/(BjiBaa)' 


~ x/(B„ Bes)' 


Bi, 


R«r 


B23 


\/(Bii B20)' 


~ s/i^n Bss)' 


B,, 


T? 


Bgj 



x/(B,3B«)' 



'TV 



a/CBss B,,y 



^ 



* As in tlie former case, these were all developed, but the extreme length of the resultmg formula? 
ives them no advantage over working in Any special case with the numerical determinants. 



MATHEMATICAL OONTRIBUTIONS TO THE THEORY OF EYOLUTIOF. 301 



In general none of these correlations ^^anisli^ and their values must all be found 
before the errors and correlations of the chief characteristics of the frequency can be 
found . 

The following result s^ easily obtained by aid of the relation tan (f) = p/'r^ will be of 



service 



^2 



—j^ {t; + tan'-^ (f>S;. — 2 tan <^S^S3r/} 



• » » 5 



(clii.), 



R 



R 






'^a 



R. 



'<ph 



COS^ ^ 



■ i V. 



XT* f ^y 



•'(ji 



tan (jy^yiXfy^ c 



t • « * « 



-~— - I ^v^JLv^.j, """" t}an Y^^^,j 



e » • • » • » 



COS^ ^ 



* 






tan ut^piAf^fj. i 



s 






/m 



tan (}>t,Mj,,.] 



» * » ff * 



/ 1 * •'» \ 

♦- »• I \jX lilt / < 



. . (cltv.). 



e » * • • t • 



(clv,). 



f . (civi). 



By (cxxvii.) a.nd (cxxxi.) if x be the mean size of organ. 



xxe* jOv> 



Ax 

Act 

(T 



a 



All — tan (hAa — ~^~~- Ad>, 

^ COS" ^ - 



ft 



:r 



+ tan(j&A^~- i^_-j 



a'^ 



S^j — S/i + tan" ^S^^ -f" _^^4. I -S^ """ 2 tan cpS/iSaR/t^, 



cos'^ <^ 



2a 



a 



-f- ^) "tan ® " <, " ^a,-*^A ttfttfe • • • . . . 



» t 



• * • 



(civil,), 



Cvllvl 



y ,2 V2 



(T 



Of 



1,2 
+ tan" <pX,^ + 47^. _ J\2 "^^ "^~ T^ ^"^^^ ^Sa^<^R«^ 



1 






1 



an <p —----— Zi^24f\%^r^ . . • , , . . 



• • 



a{r~V) 

. . . , (cLviil), 



a 






'cr*^A' 



^ "co?(S" to^^ Y^-S/iS^xb/j^ — tan <pS«S^Ba^ 



a 



a 



1 



cos«"^" ^ "~ 2^-^l) 



'^^^'^»'-^^^»' + 2(r — ^x SaS3ar+ 2^'g3-T7^ S,.S<|,B.,^ . (clix.). 



From (cxxx.) we have, if S^. == skewness, 



S'^=!fT^/('^-i)- 



302 PROFESSOR K. PEARSON AND MR. L, N. G. FILOiV 

Or, taking logaidthmic difterentials, 

ASJBj, = cot (A A(^ — j -— ;:^ -^ -1 ^^^.-:™-^- ) Ar^ 

whence, 

tIM = cot' cj^ + ^^L^_^^^^-^^^ V2 _ eot (^ ^:zr^^^^ t^t,l\^^ . (clx.). ^ 

In a similar manner E;;.g^, and R^s, can be found if desired. None of these quantities 
will, as a rule, vanish, and as very many measurements on animals give curves of 
the tangent type, we conclude that in general all selection of the size of an orgaii 
alters its variability and the shewness of its distribution, and again all selection of 
variability connotes alteration of the size and sheivness of the selected organ. 

The probable errors of 1x2, ftg, and ix^, as well as the error-correlations of these 
quantities, can all be found from the difterentials of (cxxiii.), (cxxiv.), and (cxxv.) ; 
the calculation is laborious, but presents no novelty. 

Lastly, the probable errors of the mean and modal frequencies may be deduced. 

For the mean frequency we start from (cxxxv.) and use (cxxxiv.). 

This requires us to know A^j where 

j.= S ;^™J^:^^^ (1 ^ 2^^"'-^ cos^^"^-^ 6 cos (2^? + 1) (A) . . (clxi.). 

^ (^s + I) (2s 4- 2)H^^+i ^ ^ \ I / T/ \ / 

We have, as in (cxlvdi.) and (cxlviii.), 



A^ CO / 1 \s T\ 

A^ =r ^ ^S 7V-~H™^ (1 - 2''^' cos'^-^-^^^^^> cos (2s + 2) 6) 
+ — S ^^^^\ 2^-+'' cos'^^-'-'^ S sin (26- + 2) 6 
= — Ci Ar/r + O2 Av/r, 



where Ci and Co admit of fairly easy calculation. 
Hence, by (cxxxv.), we find, 

Ayj/yi =: — Aa/a + (|- + sin'^ ^ + <?i) ^^"/'^ ^ {<^2 + cos ^ sin ^) Av/r. 
^M =^/<^ + (i + sii^' ^ + ^1)' S'/^'' + i^^^ + <^os ^ sin ^)^" S:Vr^ 



?u ^* ' ' a/r 



_ 2 (I + sin^ ^ + gj) (% + cos ^ sin <|?) ^ ^ -r^ / 1 '• '\ 

If the problem be to find the modal frequency y2 S^? we easily deduce y^ by putting 



MATHEMATICAL OOFTRIBUTIONS TO THE THEORY OP EVOLUTION. 303 



J 



X =z — ™~ in the equation to the curve. Writing tan ^' = vl{r + 2) and )( the 

ct \^r ~Y' Zt) 

same function of r + 2 that x is of r, we have 



V: 






Since -j--~-^ is greater than y^r, and <!>' and x' ^i^^ kss than ^ and ^ respectively, 
it follows that y^ is greater than 3/1, as it should be. Further we find 

— = + i 77 + sni' (i + Ci — 77 — (^2 + cos 6 sm <t> ) - — ~ . (clxiv.). 

Here c/ and Cg' are the same functions of r + 2 and <^' that Ci and Cg are of r and ^. 
The usual process of squaring and introducing the standard deviations into the square 
terms and the product of standard deviations and correlations into the product terms 
will give us %l^. 

(21.) Ilhistration.— Stature of Children. 

In order to illustrate the difficulties which may arise in determining the probable 
errors of the constants and the error correlations, we have selected for this illustration 
not a curve markedly skew, but one which is extremely nearly normal. The problem 
in this case is accordingly the following one : Are the values of the constants obtained 
for the distribution and distinguishing it from a normal distribution really significant ? 
The difficulties which ainse in the course of the arithmetical work depend upon the 
fact that, as the distribution is nearly normal, its constants approach the values at 
which the type of the skew curve passes over into the normal curve, and conse- 
o[uently not only will their probable errors be large, but, as in all cases of approach 
to limits, they will depend upon expressions tending to become indeterminate. Thus 
in the evaluation of the determinant A and its minors, we at once found our results 
depended on the ratio of the differences of very small quantities. We were accord- 
ingly in this case obliged to calculate our constituents to a degree of accuracy which 
will, in general, be quite unnecessary, and which was only possible and straight- 
forward owing to the ready help of a large sized Brunsviga. That the method, even 
in a critical case of this kind, gives correct results is evidenced by the agreement of 
our values of the constants with those (probable errors of mean and standard 
deviation) which can be readily calculated by other processes. 

The example we have selected is that given for the stature of 2192 St. Louis 
school girls of 8 years of age in ' Phil. Trans.,' A, vol. 186, p. 386. 

The equation to the frequency curve is 



304 PEOFESSOR K. PEARSON AIN^T) MR. L. N. G. FILON 

X = 14-9917 tan ^5 

y z=: 2S5'323 co^''''' eG--'''''''^ 

the axis of a? being 2^ositive towards dwarfs and the origin 2*2241 on the positive side 
of the mean. The unit of x is 2 cms. of height^ and all the constants except the 
mean height are given in two-centimetre units. 
We have the following values of the constants :— 

Mean height = 118*27'1 centims., 

o- ==: 277622, 
~ modal frequency == 324"! 8. 
=: mean frequency = 323 '76, 
Sk, = skewness ^ '04885, 

m ^ 16*4011, 
a^ 14-9917. 

It will be seen at once that the skewness is small, that the mean and mode are 
close together, and the mean and modal frequencies eire almost identical. Our 
problem is : Are these diiferences significant or not ? 

Let 7^ = 2192, the total population; then the values of a, r, v given above were 
assumed to be absolutely exact, and A calculated with its constituents to 9 places 
of figures, as it depends on the differences of very small quantities. We shall 
indicate one or two stages in the arithmetical work 



/^2 — 


7-70739, 




/X3 = 


~ 2-38064, 




/X4 — 


192-17419, 


2/s 


d -— 


-135,606, 




r — 


30-8023, 




■J. g O^tPWHI 


4-56967, 





A 


•131,108,064 


•017,214,971 


- -008,837,638 


•063,438,906 


n* 


•017,214,971 


•010,392,042 


— -003,264,669 


•008,837,688 




- •008,837,638 • 


- -003,264,669 


•001,144,775 


- -004,422,657 




•063,438,906 


■008,837,638 


- -004,422,657 


-030,799,043 



The evaluation of this determinant and its minors was then carried out by means 
of the Brunsviga, and we found 



A 


•104,824,472 


A,, 


1200-842,528 


n' "^ 


10'3 


•71" 


10'* 


Aji 


670-195,695,496 


A,, 


5059-387,378 


rr 


IQ12 


r/' 


~ (lOf 


A 22 


4606-123,523 


Au 


1762-570,609 


-" 


IQn 


,,.. - 


10'* 


A38 


76025-131,845 


Aojj 


18675-261,289 


""" 


10^2 





IQia 


^u 


4828-382,384 


Aa, 


3833-460,555 




10« 


7V' 


10' 






Asi 


15979-332,581 






' 


10' 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 305 

We can now, remembering that 

S„ = y/{AjA), and E,,,, = Aj\/{ApjAJ, 

"Write down the standard deviations and error correlations of the algebraical constants 

t„= 1-7078, 

S„= 4-4773, 

2j. =^ lo'lSOo, 
S„= 4-5840, 



1> .^ 


- -6835, 


Kr = 


- -7088, 


Rft. = 


- -9798, 


Rffi. ■— 


•9980, 


R„ = 


•8129, 


R,„ = 


•8340. 



Here h marks the position of the origin of the curve, and the numerical values are 
only retained to four places of figures, although, of course, in the further calculations 
the logarithms of the full values of the t's and "R's have been used. 

It will be noticed at once that though a, r, and p have very considerable probable 
errors, the correlation between them is very high. In other words, as the curve 
approaches its limiting shape, a, r, and p may vary very considerably, but owing to 
their close correlation this will not sensibly affect the geometrical shape of the 
curve. 

The next stage was to determine the standard deviations and error correlations 
of certain subsidiary constants. Here, as in the determination later of the like 
quantities for the "^^ physical'' characters, we found the umbral notation of great 
service. It consists, as we have seen, in waiting down a difference equation between 
any constants, and then replacing the difterences Ste by t,Xt>.> ^^ by S,,^^, &c., where 
Xu^ Xv^ &<5-. are quantities which obey the relations xl = 1, x' = 1^ K^'Xv = R..- Thus, 
if tan<^ = p/r, we find for the umbral equation 

V ^ cos"<^2,. sin c^ cos ^ 

Whence, putting in the numerical values, we have 

Sr/.X<^ = A.ntL r'l63,2] 15x. — Antl 2*933,0706x. 

where Antl. stands for antilogarithm, in which form we found it easiest to keep the 
umbral coefFicieots, The square of this result gave at once 

and dividing out by its logarithm, we have the pure umbral equation 

X^ =: Antl. •219,0937x. - Antl I-988,9728x.. 

VOL. CXCI.— A 2 R 



306 



PROFESSOE K. PEARSON AND MR. L. 1^, G. FILON 



Our object was then to find such pure umbral equations connect hig all the 
^' physical" constants with the algebraic constants. Their products will then give 
the error correlations of all the '^ physical " constants in terms of the correlations 
already known between the algebraical constants. 

For example, multiplying the above equation for x<i> by X/.5 X"? X^? X^ "^^^ have, since 
XaX« — Pv.c, XaXr = ^^^n &c., are already known, 

E,^^ =::. ^ -9317, actually log {- R,^) = I'969/2668, 



13 ^ 



•3733, 
•4063, 



3? 



3> 



loffE 



a^ 



log R, 



■^ 



= 1-572,0115, 
= T-608,8746, 



1 



\^ = '8430, „ log E,^ = r925,8379. 



It was these logarithms, of course, which were used in the further calculations. 
Since h is measured negatively {ix., towards dwarfs, x is positive), we must wri 
for transferring origin to the mean 

x^ =: X -^ a tan c^, 

where a tan c/i is the distance between the old origin and the mean, or if m be used 

to represent the mean we have 

m ^= h -i" a tan (jx 

Hence we find the umbral equation 

,,X,. = Antl '232,4493x/. + Antl r'822,3l79x. + Anth '129,4233x.^. 



Hence we determine 

and the pure umbral equation 

X,.^ Antl l-492,5897x/.4 



' III 



'0549, 



Antl. r082,4583x.+ Antl ^389,5637x.^. 



In precisely the same way all the other '' physical '' constants, ie,, the standard 
deviation, cr, the mean frequency, y^, the distance between mean and mode, d, and 
the skewness, Sk., were found, and the umbral equations investigated. It is only 
necessary here to give the results. 




0) if, 

O 

o 






< 



h^ positiuii of origin , 



i-> 



^Positiou of moan . 

Position of mode . . . 

, Mean frequency, y.^ . . 

^ Standard deviation, o . 

Mean to mode, d . ^ < 

Skewness, Sk. . . . 



« 9 





Percentage 


Probable error. 


probable error. 


1-1519 




3'0199 


20-1438 


12-2688 


39-8309 


3'0919 


67-G612 


0-03705 


— 


0*05950 




4'4362 


1-3703 


0-02981 


1-0750 


0-0497 


36-6420 


0-020G1 


54-4690 



MATHEMATICAL CONTEIBUTIONS TO THE THEORY OP EVOLUTION. 307 



Now it will be seen at once that the probable errors in the algebraic constants are 
large, but that the probable errors in the position of the mean, of the mode, and in 
the magnitudes of the mean frequency and standard deviation are small. The position 
of the mean is sensibly more correct than that of the mode. On the other hand, the 
distance of the mean from the mode and the skewness have large probable errors, not, 
however, so large but what these quantities are probably significant. The frequency 
distribution probably differs significantly from the normal distribution, but the 
difference is small and would require a very large number of observations to deter- 
mine it with extreme accuracy. That there is a significant divergence from normality 
is also indicated by the sensible difference between the percentage errors in y^ and cr, 
which would be equal for a normal distribution. H"ad we taken a normal distribution, 
the probable error of the mean would have been '040.0, and of the standard deviation, 
•02831. In fact, the standard deviation of the standard deviation, if calculated for 
the normal curve = '04197, if calculated by our present method = '044246, and if 
calculated by a modified form of the fourth moment formula given by Czuber^ 
= '044240. This shows that the arithmetic of our process has been substantially 
correct. 

We now place together the umbral equations for the correlations of the errors in 
the '' physical " constants; They are 



Xo" 

Xd 



Antl. l-492,5897Xft + Aiul l-082,4583x„ + Aiitl, l-389,5637x^ 
Antl. r272,7484x« — Antl. l-282,1306x,. + -^ntl. r913,0028x^ 
Anfcl. I-816,6966x^ — Antl. l-]67,3480xa + Antl. l-155,7688x,. 
Antl. •266,3616x. + Antl. r740,1625x„ - Antl. •323,8256x. 
Antl. r-865,6420x* — ^^tl. •0l7,0466x,.. 



From these results any correlation between pairs of errors, " physical " or algebraic, 
can be found at once. The following table gives the chief results : — 

CoREELATiON Coefficients between Errors in Constants. 





m. 
1 


fir. 


3/2- 


d. 


sk. 


on 


•0772 


•0584 


•0826 


•0426 


<T 


•0772 


1 


-•7062 


•1177 


•1431 


'i/2 


-"•0584 


•7062 


1 


•1779 


•4086 


d 


•0826 


•1177 


•1779 


1 


•6843 


sJc, 


•0426 


•1431 


•4086 


•6843 


1 



* * Tbeorie der Beobaclitungsfehler,' p. 133, 

2 R 2 



308 



PROFESSOR K. PEARSON AND MR. L= N, G. FILOK 



Now it is clear that although the curve is nearly norinalj there is still sensible 
correlation between quantities— 6,^9^.^ n^ean and a or d!— -which would have no cor- 
relation between them if the curve v/ere absolutely normal. This will be clearer if, 
as in the previous illustration^ we replace this table by a table of regression 
coefficients. 



i 


"in. 


2/2. 


0-. 


d. 


sJc. 


"ill 


1 


-•0005 


■ '0959 


•0616 


•05935 


2/2 


6-9883 


1 


104-9745 


15-8875 


68-1271 


(T 


•0622 


•00475 


1 


•0707 


•16045 


d 


•1108 


• ^0020 


•1960 


1 


1-2780 


sh. 


•0306 


•00245 


•12755 


•3665 


1 



This table has now finally to be thrown into more suitable units and attention 
paid to the fact that 7n increases toivards chvarfs. We have, after the proper changes^ 



the following results :* 



PROGRESsiOjsr Table. 



Corresponds to probable 
changes in the same units of 


Unit change of 


One centim. 
in mean 
stature. 

1 

1-0798 
-•0622 
-•1108 
-1530 


One child 

per hundred 

in frequency 

of mean 

stature. 


One centim. 

in standard 

deviation. 


One centim. 

in interval 

from mean 

to mode. 

-•0616 

2-4536 

•0707 

1 

1-8333 


1/10 in the 
skewness. 


Mean stature 

Mean frequency 

Standard deviation .... 
Interval from mean to mode 

Skewness 


•0032 
1 
•0308 
•0129 

•0794 


"~»°0959 
16'1745 

1 
•1960 

'6378 


•0119 

2-1042 

•0321 

•2556 
1 



This table is extremely suggestive. It shows us that a random selection of girls 
of eight which had an increase of stature w^ould have a less standard deviation^ less 
distance between the mode and mean and less skewness. In other words, a selection 
giving taller children would be less variable and more nearly normal. Now as 
children grow older their stature increases^ is less variable^ and is more normal in its 
distribution. Thus, a selection of taller children from among children of eight 
would broadly tend to reproduce the characters of the stature distribution of older 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 309 

children. In the same manner a selection of shorter children is more variable and 
less normal than the distribution of the general population of eight years of age, i.e.^ 
tends to reproduce the characteristics of a younger population. Generally, a random 
selection, which increases variability, very sensibly increases skewness and decreases 
stature. What, perhaps, would hardly be expected, is that increase of skewness as 
well as increase of interval from mean to mode, i,e,, greater divergence from normality, 
increases the frequency of the mean stature. 

It will be clear that by aid of this table we are able to predict the probable 
changes in all the other physical characters of the distribution when any sub-class 
has been selected at random from the general population with a difference of one 
character. If two or more characters differ in the sub -class, the probable changes 
in the other characters can be found by the principles of multiple correlation from 
the correlation table on page 307. 

(22.) Conclusion. 

This study of the probable errors and error correlations shows us that these 
quantities can be determined for the most complex system of organs in the case 
of normal correlation, and in the case of either normal or skew variation with con- 
siderable ease. It is only in the case of skew variation that the arithmetic becomes 
at all laborious. But numerical examples suffice to show that the errors here 
made are of the same order as in the case of normal variation, if we confine our 
attention to the characteristic features of the frequency, e.g,, the mean or modal 
frequency, the standard deviation, the skewness, &c. Certain constants of the 
algebraic form of the frequency curves have large probable errors, but these errors 
are so highly correlated, that their existence does not suffice to substantially modify 
either the form of the curve, or the '' physical " characteristics of the distribution 
calculated from such values. 

For the theory of evolution certain very important principles flow, beyond the 
mere advantage of knowing the probable errors made in the measurement of racial 
or organic characters. Above all we note the importance of a random selection in 
altering in a systematic manner all racial constants. In most cases even size cannot 
be altered without alteration of the size, variation and correlation of all correlated 
organs. This principle is developed more at length in a memoir, nearly completed, 
on the influence of directed selection, which covers as a special case that of random 
selection. 

Later, we hope to apply the general theorem from which our memoir starts to 
determine the probable errors in the constants of the components into which 
a heterogeneous frequency distribution may be resolved by the method of the first 
memoir of this series. ^^ It applies equally to such an investigation. 

* The importance of sncli a determination was emiDliasized by Professor George Darwin in the 
discussion which took place at the reading of that memoir, 



310 PROFESSOR K. PEARSOF AND MR. L. ¥. G. ¥ILOF 

[Note.— Added May 25, 1898. One point ought to have been more fully dealt with 
in the above memoir, namely, the probable error of the criterion /c = 6 + ^^i "- 2^2? 
upon which the selection of the type of the frequency depends. Clearly, if the 
probable error of this criterion is as large as the criterion itself j there can be no 
stability of type^ or the frequency may change over from one type to another. 

On page 289 we have found the standard deviation of the criterion in terms of 
known quantities for the curve 

It is in fact given by the umbral equation 

where ii and 4 are functions of mi and nh given in (cxx.) and XwiX''% ^ ^m.m, is known 
from (cvi.). 

The standard deviation of the criterion for the curve of type 

may be found by taking differentials of 

12 f . r ■— 1 

K = 6 + 3i8i — 2B. = — -~^~^ \ 4 sin'^ S -^--^--^ 4. 1 

' ' ^' """ '^ L \^' "^ ^/ 

a value readily obtainable from 'Phil. Trans,/ A^ vol 186, p, 377. We thus find 
the umbral equation 

f . . , r' — 3 r + 1 , 12 1 ^ 96 sin ^ cos dj (r — 1) . . , . . 

XX = |96 snr <^ ^^-^-j^^--^ + ^^^ %x. - -^-^-^^--^^j^ t^x^ (clxvu) 

= i/%^ "^ i:t^, say, 

where S,, 2^ and XrX<l> = ^'* ^^'® g^^®°- ^J i^^"-) ^^^ (cliv.). 
Applying these results to the numerical examples, we find : — 

{a.) For the glands of swine 

It x„ =: - -070,5459 4,v,„, ~ -038,4629 S»,x*. 

tc 

whence the probable error of k =^ '67449 % = '1012 ; or, 

K= -6559 ± 1012. 

{!).) For the stature of children 

t^X, =/015,6206 %Xr - "018,0067 2^x# 
whence the probable error of k = '67449 S« = *1919 ; or^ 

fc= - •4330 ±"1919. 



MATHEMATICAL CONTRIBUTIONS TO THE THEORY OF EVOLUTION. 311 

In both cases, therefore, we may consider that the sign of k is beyond question, or 
that the type selected is really a significant character of the frequency. 

With regard to the probable error made in estimating a criterion to be zero, and 
using a curve of type 

we must remark that, the criterion being assumed zero is equivalent to assuming 
that its probable error is zero. Accordingly the only satisfactory method of testing 
whether a curve really falls under this type is to work out the probable error of its 
criterion on the hypothesis that it belongs to one or other of the two types, with 
positive or negative criterion as the case may be. If the probable error of the 
criterion thus calculated is sensibly as large as the criterion itself, then we may 
assume that the frequency distribution is of the type 



